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"MORPHOOBNIC PROTEIN- SPECIFIC CELL SURFACE RECEPTORS 
AND USES THEREFOR" 

Field of the Invention 

This invention relates generally to the field of tissue morphogenesis 
and more particularly to morphogenic protein-specific cell surface receptors. 

5 Background of the Invention . 

Cell differentiation is the central characteristic of tissue 

morphogenesis which initiates during embryogenesis, and continues to various 
degrees throughout the life of an organism in adult tissue repair and 
regeneration mechanisms. The degree of morphogenesis in adult tissue varies 
10 among different tissues and is related, among other things, to the degree of 
cell turnover in a given tissue. 

The cellular and molecular events which govern the stimulus for 
differentiation of cells is an area of intensive research. In the medical and 
veterinary fields, it is anticipated that the discovery of the factor or 
15 factors which control cell differentiation and tissue morphogenesis will 

advance significantly medicine's ability to repair and regenerate diseased or 
damaged mammalian tissues and organs . Particularly useful areas for human and 
veterinary therapeutics include reconstructive surgery and in the treatment of 
tissue degenerative diseases including arthritis, emphysema, osteoporosis, 

20 cardiomyopathy, cirrhosis, degenerative nerve diseases, inflammatory diseases, 
and cancer, and in the regeneration of tissues, organs and limbs. (In this 
and related applications, the terms "morphogenetic" and "morphogenic" are used 
interchangeably.) 

A number of different factors have been isolated in recent years which 
25 appear to play a role in cell differentiation. Recently, a distinct subfamily 
of the "super family" of structurally related proteins referred to in the art 
as the "transforming growth factor-b (TGF-6) superfamily of proteins" have 
been identified as true tissue morphogens. 
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The members of this distinct "subfamily " of true tissue morphogenic 
proteins share substantial amino acid sequence homology within their 
morphogenetically active C-terminal domains (at least 50% identity in the C- 
terrainal 102 amino acid sequence} , including a conserved six or seven 
cysteine skeleton, and share the in vivo activity of inducing tissue-specific 
morphogenesis in a variety of organs and tissues. The proteins apparently 
contact and interact with progenitor cells e.g., by binding suitable cell 
surface molecules, predisposing or otherwise stimulating the cells to 
proliferate and differentiate in a morphogenetically permissive environment. 
These morphogenic proteins are capable of inducing the developmental cascade 
of cellular and molecular events that culminate in the formation of new organ- 
specific tissue, including any vascularization, connective tissue formation, 
and nerve innervation as required by the naturally occurring tissue . The 
proteins have been shown to induce morphogenesis of both bone cartilage and 
bone, as well as periodontal tissues, dentin, liver, and neural tissue, 
including retinal tissue. 

The true tissue morphogenic proteins identified to date include 
proteins originally identified as bone inductive proteins. These include OP- 
1, (osteogenic protein-1, also referred to in related applications as "OP1"), 
its Drosophila homolog, 60A, with which it shares 69% identity in the C- 
terminal "seven cysteine" domain, and the related proteins OP-2 (also referred 
to in related applications as "OP2") and OP-3, both of which share 
approximately 70-75% identity with OP-1 in the C-terminal seven cysteine 
domain, as well as BMPS, BMP 6 and its murine homolog, Vgr-1, all of which 
share greater than 85% identity with OP-1 in the C-terminal seven cysteine 
domain, and the BMP6 Xenopus homolog, Vgl, which shares approximately 57% 
identity with OP-1 in the C-terminal seven cysteine domain. Other bone 

inductive proteins include the CBMP2 proteins (also referred to in the art as 

BMP2 and BMP4) and their Drosophila homolog, DPP. Another tissue morphogenic 
protein is GDF-1 (from mouse) . See, for example, PCT documents US92/01968 and 

US92/07358, the disclosures of which are incorporated herein by reference. 
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As stated above, these true tissue morphogenic proteins are recognized 
in the art as a distinct subfamily of proteins different from other members of 
the TGF-B superfamily in that they share a high degree of sequence identity in 
the C- terminal domain and in that the true tissue morphogenic proteins are 
5 able to induce, on their own, the full cascade of events that result in 
formation of functional tissue, rather than merely inducing formation of 
fibrotic (scar) tissue. Specifically, members of the family of morphogenic 
proteins are capable of all of the following in a morphogenetically permissive 
environment: stimulating cell proliferation and cell differentiation, and 

10 supporting the growth and maintenance of differentiated cells. The 

morphogenic proteins apparently may act as endocrine, paracrine or autocrine 
factors. 

The morphogenic proteins are capable of significant species 
"crosstalk . * That is, xenogenic (foreign species) homologs of these proteins 
15 can substitute for one another in functional activity. For example, DPP and 
60A, two Drosophila proteins, can substitute for their mammalian homologs, 
BMP2/4 and OP-1, respectively, and induce endochondral bone formation at a 
non-bony site in a standard rat bone formation assay. Similarly, BMP2 has 

been shown to rescue a dpp“ mutation in Drosophila. In their native form, 

20 however, the proteins ' appear to be tissue-specific, each protein typically 

being expressed in or provided to one or only a few tissues or, alternatively, 
expressed only at particular times during development. For example, GDF-1 
appears to be expressed primarily in neural tissue, while OP-2 appears to be 
expressed at relatively high levels in early (e.g., 8-day) mouse embryos. The 
25 endogenous morphogens may be synthesized by the cells on which they act, by 

neighboring cells, or by cells of a distant tissue, the secreted protein being 
transported to the cells to be acted on. 

A particularly potent tissue morphogenic protein is OP-1 . This 
protein, and its xenogenic homologs, are expressed in a number of tissues, 

30 primarily in tissues of urogenital origin, as well as in bone, mammary and 
salivary gland tissue, reproductive tissues, and gastrointestinal tract 
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tissue. It is also expressed in different tissues during embryogenesis , its 
presence coincident with the onset of morphogenesis of that tissue. 

The morphogenic protein signal transduction across a cell membrane 
appears to occur as a result of specific binding interaction with one or more 
5 cell surface receptors. Recent studies on cell surface receptor binding of 

various members of the TGF-fi protein superfamily suggests that the ligands can 
mediate their activity by interaction with two different receptors, referred 
to as Type I and Type II receptors to form a hetero-complex. A cell surface 
bound beta-glycan also may enhance the binding interaction. The Type I and 
10 Type II receptors are both serine /threonine kinases, and share similar 

structures: an intracellular domain that consists essentially of the kinase, 

a short, extended hydrophobic sequence sufficient to span the membrane one 
time, and an extracellular domain characterized by a high concentration of 
conserved cysteines. 

15 A number of Type II receptor sequences recently have been identified. 

These include "TGF-ER II ", a TGF-fi Type II receptor (Lin et al. (1992) Cell 
68:775-785); and numerous activin-binding receptors. See, for example, 

Mathews et al . (1991) Cell £5:973-982 and international patent application WO 

92/20793, published November 26, 1992, disclosing the "ActR II” sequence; 

20 Attisano et al., (1992) Cell 68:97-108, disclosing the "ActR-IIB" sequence; 
and Legerski et al . (1992) Biochem Biophys. Res. Commun 183 : 672-679 . A 

different Type II receptor shown to have affinity for activin is Atr-II 
(Childs et al.(1993) PNAS 90:9475-9479.) Two Type II receptors have been 
identified in C. elegans , the daf-1 gene, (Georgi et al . (1990) Cell 61 : 635- 
25 645), having no known ligand to date, and daf-4, which has been shown to bind 

BMP 4 , but not activin or TGF-S (Estevez, et al. (1993) Nature 365 :644-649. ) 

Ten Dijke et al . disclose the cloning of six different Type I cell 
surface receptors from murine and human cDNA libraries. ((1993) Oncogene 
8:2879-2887, and Science (1994) 264 : 101-104 . These receptors, referenced as 
30 ALK-1 to ALK-6 ("activin receptor-like kinases"), share significant sequence 
identities (60-79%) and several have been identified as TGF-S binding (ALK-5) 
or activin binding (ALK-2, ALK-4) receptors. Xie et al . also report a 
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Drosophila Type I receptor encoded by the sax gene ( Science (1994) 263 : 1756- 
1759) . The authors suggest that the protein binds DPP. 

•To date , the Type I receptors with which the morphogenic proteins 
described herein interact on the cell surface have not yet been identified, 

5 and no Type II receptor has been described as having binding affinity for OP-1 
and its related sequences. Identification of these cell surface molecules, 
with which the morphogens interact and through which they may mediate their 
biological effect, is anticipated to enhance elucidation of the molecular 
mechanism of tissue morphogenesis and to enable development of morphogen 
10 receptor binding "analogs", e.g., compounds (which may or may not be amino 
acid-based macromolecules) capable of mimicing the binding affinity of a 
morphogen for its receptor sufficiently to act either as a receptor binding 
agonist or antagonist. These "analogs" have particular utility in 
therapeutic, diagnostic and experimental research applications. 

15 It is an object of this invention to provide nucleic acid molecules and 

amino acid sequences encoding morphogenic protein binding cell surface 
receptors, particularly OP-l-specif ic binding receptor sequences . Another 
object is to provide methods for identifying genes in a variety of species 
and/or tissues, and in a variety of nucleic acid libraries encoding 
20 morphogenic protein binding receptors, particularly receptors that bind OP-1. 
Still another object is to provide means for designing biosynthetic receptor- 
binding ligand analogs, particularly OP-1 analogs, and/or for identifying 
natural-occurring ligand analogs, including agonists and antagonists, using 
the receptor molecules described herein, and analogs thereof. Another object 
25 is to provide antagonists, including soluble receptor constructs comprising 
the extracellular ligand-binding domain, which can modulate the availability 
of OP1 for receptor binding in vivo . Another obect is to provide means and 
compositions for competing with activin-receptor and BMP2/ 4 -receptor 
interactions. Yet another object is to provide means and compositions for 
30 ligand affinity purification and for diagnostic detection and quantification 
of ligands in a body fluid using OPl-specific cell surface receptors and 
ligand-binding fragments thereof. Still another object is to provides means 
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and compositions for modulating the endogenous expression or concentration of 
these receptor molecules. Yet another object is to provide ligand-receptor 
complexes and analog sequences thereof, as well as antibodies capable of 
identifying and distinguishing the complex from its component proteins. Still 
5 another object is to provide means and compositions for modulating a 

morphogenesis in a mammal. These and other objects and features of the 
invention will be apparent from the description, drawings and claims which 
follow. 

Summary of the Invention 

10 Type 1 and T yP e 11 cel l surface receptor molecules capable of specific 

binding affinity with true tissue morphogenic proteins, particularly OP-1- 
related proteins, now have been identified. Accordingly, the invention 
provides ligand-receptor complexes comprising at least the ligand binding 
domain of these receptors and OP-1 or an OP-1 receptor-binding analog as the 
15 ligand; means for identifying and/or designing useful OP-1 receptor-binding 
analogs and OP-l-binding- receptor analogs; and means for modulating the 
tissue morphogenesis capability of a cell. 

The morphogen cell surface receptors useful in this invention are 
referred to in the art as Type I or Type II serine/threonine kinase receptors. 
20 They share a conserved structure, including an extracellular, ligand-binding 
domain generally composed of about 100-130 amino acids (Type I receptors; up 
to about 196 amino acids for Type II receptors) , a transmembrane domain 
sufficient to span a cellular membrane one time, and an intracellular 
(cytoplasmic) domain having serine/ threonine kinase activity. The intact 
25 receptor is a single polypeptide chain of about 500-550 amino acids and having 
an apparent molecular weight of about 50-55 kDa. 

Of particular utility in the methods and compositions of the invention 
are the Type I cell surface receptors referenced herein and in the literature 
as, ALK-2 , ALK-3 and ALK-6 , whose nucleic acids and encoded amino acid 
30 sequences are represented by the sequences in Seq. ID Nos. 3, 5 and 7 

respectively, and which, as demonstrated herein below, have specific binding 
affinity for OP1 and OPl-related analogs. Accordingly, in one embodiment, the 
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receptor sequences contemplated herein include OP-1 binding analogs of the 
ALK-2 , ALK-3 and ALK-6 proteins described herein. 

As used herein, ligand-receptor binding specificity is understood to 
mean a specific, saturable noncovalent interaction between the ligand and the 
5 receptor, and which is subject to competitive inhibition by a suitable 

competitor molecule. Preferred binding affinities (defined as the amount of 
ligand required to fill one-half (50%) of available receptor binding sites) 
are described herein by dissociation constant (Kd) . In one embodiment, 
preferred binding affinities of the ligand-receptor complexes described herein 
10 have a Kd of less than 10“ 7 M, preferably less than 5xlO“ 7 M, more preferably 

less than 10 ~®M. In another preferred embodiment, the receptor molecules have 
little or no substantial binding affinity for TGF-E. 

As used herein, an "OPl-specif ic receptor analog" is understood to mean a 
sequence variant of the ALK-2, ALK-3 or ALK-6 sequences which shares at least 
15 40%, preferably at least 45%, and most preferably at least 50%, amino acid 

identity in the extracellular ligand binding domain with the sequence defined 
by residues 23-122 of Seq. ID No. 7 (ALK-6) , and which has substantially the 
same binding affinity for 0P1 as ALK-2, ALK-3 or ALK-6. ALK-6 and ALK-3 share 
46% amino acid sequence identity in their ligand binding domains. 

20 Accordingly, in one preferred embodiment, the OP1 -specific receptor analogs 

share at least 46% amino acid sequence identity with the extracellular, ligand 
binding domains of ALK-6 or ALK-3 . 

As will be appreciated by those having ordinary skill in the art, OPl- 
specif ic receptor analogs also can have binding affinity for other, related 
25 morphogenic proteins . As used herein, an OPl-specif ic receptor analog is 

understood to have substantially the same binding affinity for OP-1 as ALK-2, 
ALK-3 or ALK-6 if it can be competed successfully for OP-1 binding in a 
standard competition assay with a known OP-1 binding receptor, e.g., with ALK- 
2, ALK-3 or ALK-6. In one preferred embodiment, OPl-specif ic receptor analogs 
have a binding affinity for OP-1 defined by a dissociation constant of less 

than about 10~ 7 M, preferably less than about 5 x 10~ 7 M or 10"® M. It is 



30 
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anticipated however, that analogs having lower binding affinities, e.g., on 

the order of 10“ 6 M also will be useful. For example, such analogs may be 
provided to an animal to modulate availability of serum-soluble 0P1 for 
receptor binding in vivo . Similarly, where tight binding interaction is 
5 desired, for example as part of a cancer therapy wherein the analog acts as a 
ligand-receptor antagonist, preferred binding affinities may be on the order 
of 5xlO“®M. 

In another embodiment, the OP-1 binding receptor analogs contemplated 
by the invention include proteins encoded by nucleic acids which hybridize 
10 with the DNA sequence encoding the extracellular, ligand binding domain of 

ALK-2 , ALK-3 or ALK-6 under stringent hybridization conditions, and which have 
substantially the same OP-1 binding affinity as ALK-2, ALK-3 or ALK-6. As 
used herein, stringent hybridization conditions are as defined in the art, 

(see, for example, Molecular Cloning: A Laboratory Manual , Maniatis et al . , 

15 eds. 2d.ed. , Cold Spring Harbor Press, Cold Spring Harbor, 1989.) An 

exemplary set of conditions is defined as: hybridization in 4 0% formamide, 5 

X SSPE , 5 X Denhardt’s Solution, and 0.1% SDS at 37°C overnight, and washing 
in 0.1 X SSPE, 0.1% SDS at 50°C. 

In still another embodiment, the OP-1 binding receptor analogs 
20 contemplated by the invention include part or all of a serine/ threonine kinase 
receptor encoded by a nuclei c acid that can be amplified with one or more 
primers derived from ALK-1 (Seq. ID No. 1), ALK-2,* ALK-3 or ALK-6 sequence in 
a standard PCR (polymerase chain reaction) amplification scheme. In 
particular, a primer or, most preferably, a pair of primers represented by any 
25 of the sequences of SEQ ID Nos. 12-15 are envisioned to be particularly 
useful. Use of primer pairs (e.g., SEQ. ID No. 12/15; 13/15; 14/15) are 
described in WO94/11502 (PCT/GB93/02367) . 

Useful OPl-specific receptor analogs include xenogenic (foreign 
species) homologs of the murine and human ALK sequences described herein, 

30 including those obtained from other mammalian species, as well as other, 
eukaryotic, non-mammalian xenogenic homologs. Also contemplated are 
biosynthetic constructs and naturally-occurring sequence variants of ALK-2, 
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ALK-3 and ALK-6, provided these molecules, in all cases, share the appropriate 
identity in the ligand binding domain, and bind OP-1 specifically as defined 
herein. In one embodiment, sequence variants include receptor analogs which 
have substantially the same binding affinity for OP1 as ALK-2, ALK-3 or ALK-6 
5 and which are recognized by an antibody having binding specificity for ALK-2, 
ALK-3 or ALK-6. 

In another embodiment the receptors and OP-1 binding receptor analogs 
contemplated herein provide the means by which a morphogen, e.c., OP-1, can 
mediate a cellular response. In one embodiment these receptors include ALK-2, 
10 ALK-3, or ALK-6, or sequence variants or OP-1 binding analogs thereof. In 

another embodiment, ALK-1 , including sequence variants thereof is contemplated 
to participate in an OP-1 mediated cellular response. 

OPl-specific receptor analogs may be used as OPl antagonists. For 
example, a soluble form of a. receptor, e.g., consisting essentially of only 
15 the extracellular ligand-binding domain, may be provided systemically to a 

mammal to bind to soluble ligand, effectively competing with ligand binding to 
a cell surface receptor, thereby modulating (reducing) the availability of 
free ligand in vivo for cell surface binding. 

The true tissue morphogenic proteins contemplated as useful receptor 
20 • ligands in the invention include OP-1 and OP-1 receptor-binding analogs. As 

used herein, an "OP-1 analog* 1 or *OP-l receptor-binding analog" is understood 
to include all molecules able to functionally substitute for OP-1 in Type I 
receptor binding, e.g., are able to successfully compete with OP-1 for 
receptor binding in a standard competition assay. In one embodiment, useful 
25 OP-1 receptor-binding analogs include molecules whose binding affinity is 

defined by a dissociation constant of less than about 5x 10"^M, preferably 
less than about 10 _7 M or 5x 10‘ 7 M. As for the OP-specific receptor analogs 
above, both stronger and weaker binding affinities are contemplated to be 
useful in particular applications. In one preferred embodiment, these 
30 : receptor-binding OP-1 analogs also bind OP-1 specific Type II serine/kinase 



receptors . 
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The OP-1 analogs contemplated herein, all of which mimic the binding 
activity of OP-1 or an OP-l-related protein sufficiently to act as a 
substitute for OP-1 in receptor binding, can act as OP-1 agonists, capable of 
mimicking OP-1 both in receptor binding and in inducing a transmembrane effect 
5 e.g., inducing threonine or serine-specific phosphorylation following binding. 

Alternatively, the OP-1 analog can act as an OP-1 antagonist, capable of 
mimicking OP-1 in receptor binding only, but unable to induce a transmembrane 
effect, thereby blocking the natural ligand from interacting with its 
receptor, for example. Useful applications for antagonists include their use 
10 as therapeutics to modulate uncontrolled differentiated tissue growth, such as 
occurs in malignant transformations such as in osteosarcomas or Paget's 
disease. 

OP-1 analogs contemplated by the invention can be amino acid-based, 
e.g., sequence variants of OP-1, or antibody-derived sequences capable of 
15 functionally mimicking OP-1 binding to an OP-l-specif ic receptor. Examples of 
such antibodies may include . anti-idiotypic antibodies. In a specific 
embodiment, the anti-idiotypic antibody mimics OPl both in receptor binding 
and in ability to induce a transmembrane effect. Alternatively, the OP-1 
analogs can be composed in part or in whole of other chemical structures, 

20 e.g., the analogs can be comprised in part or in whole of nonproteinaceous 

molecules. In addition, the OP-1 analogs contemplated can be naturally 
sourced or synthetically produced. 

As used herein, OP-1 related sequences include sequences, sharing at 
least 60%, preferably greater than 65% or even 70% identity with the C- 
25 terminal 102 amino acid sequence of OP-1 as defined in Seq ID NO. 7, and which 
are able to substitute for OP-1 in ligand binding to ALK-2 , ALK-3 or ALK-6, 
(e.g., able to compete successfully with OP-1 for binding to one or more of 
these receptors.) OP-1 related sequences contemplated by the invention 
include xenogenic homologs (e.g., the Drosophila homolog 60A) , and the related 
30 sequences referenced herein and in the literature as OP-2, OP-3, BMPS, BMP 6 
(and its xenogenic homolog Vgr-1.) OP-1 related sequences also include 
sequence variants encoded by a nucleic acid which hybridizes with a DNA 
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sequence comprising the C- terminal 102 amino acids of Seq. ID No. 9 under 
stringent hybridization conditions and which can substitute for OP1 in an OP1- 
receptor binding assay. In another embodiment, an OP1 sequence variant 
includes a protein which can substitute for OP1 in a ligand-receptor binding 
5 assay and which is recognized by an antibody having binding specificity for 
OP1. 

As used herein, "amino acid sequence homology" is understood to mean 
amino acid sequence similarity, and homologous sequences sharing identical or 
similar amino acids, where similar amino acids are conserved amino acids as 
10 defined by Dayoff et al . , Atlas of Protein Sequence and Structure ; vol.5, 
Suppl.3, pp. 345-362 (M.O. Dayoff, ed., Nat 1 1 BioMed. Research Fdn. , 

Washington D.C. 1978.) Thus, a candidate sequence sharing 60% amino acid 
homology with a reference sequence requires that, following alignment of the 
candidate sequence with the reference sequence, 60% of the amino acids in the 
15 candidate sequence are identical to the corresponding amino acid in the 
reference sequence, or constitute a conserved amino acid change thereto. 

"Amino acid sequence identity" is understood to require identical amino acids 
between two aligned sequences. Thus, a candidate sequence sharing 60% amino 
acid identity with a reference sequence requires that, following alignment of 
2 0 the candidate sequence with the reference sequence, 60% of the amino acids in 

the candidate sequence are identical to the corresponding amino acid in the 
reference sequence. 

As used herein, all receptor homologies and identities calculated use 
ALK-6 as the reference sequence, with the extracellular domain reference 
25 sequence constituting residues 23-122 of Seq. ID No. 7; and the intracellular 
serine/threonine kinase domain reference sequence constituting residues 206- 
495 of Seq. ID No. 7. Similarly, all OP-1 related protein homologies and 
identities use OP-1 as the reference sequence, with the C- terminal 102 amino 
acids described in Seq. ID No. constituting the seven cysteine domain. 

30 Also as used herein, sequences are aligned: for homology and identity 

calculations as follows: Sequences are aligned by eye to maximize sequence 

identity. Where receptor amino acid extracellular domain sequences are 
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compared, the alignment first maximizes alignment of the cysteines present in 
the two sequences, then modifies the alignment as necessary to maximize amino 
acid identity and similarity between the two sequences. Where amino acid 
intracellular domain sequences are compared, sequences are aligned to maximize 
5 alignment of conserved amino acids in the kinase domain, where conserved amino 
acids are those identified by boxes in Fig. 3. The alignment then is modified 
as necessary to maximize amino acid identity and similarity. In all cases, 
internal gaps and amino acid insertions in the candidate sequence as aligned 
are ignored when making the homology/ identity calculation. Exemplary 
10 alignments are illustrated in Figs. 2 and 3 where the ammo acid sequences for 
the extracellular and intracellular domains, respectively are presented in 
single letter format . In the figures “gaps" created by sequence alignment are 
indicated by dashes. 

In one aspect, the invention contemplates isolated ligand-receptor 
15 complexes comprising OP-1 or an OP-1 analog as the ligand in specific binding 
interaction with an OP-1 binding Type X receptor or receptor analog, as 
defined herein. In another aspect, the invention contemplates the ligand- 
receptor complex comprises part or all of an OP-1 binding Type II receptor. 
Type II receptors contemplated to be useful include Type II receptors defined 
20 in the literature (referenced hereinabove) as having binding specificity for ' 
activin or a bone morphogenic protein such as BMP-4 . Such Type II receptors 
include daf4, ActRII and Atrll . In still another aspect, the ligand-receptor 
complex comprises both a Type 1 and a Type II receptor and OP1, or an OP1 
analog as the ligand. In all complexes, the bound receptor can comprise just 
25 the extracellular, ligand binding domain, or car. also include part or all of 
the transmembrane sequence, and/or the intracellular kinase domain. 

Similarly, the OP-1 ligand may comprise just the receptor binding sequence, 
longer sequences , including the mature dimeric species or any soluble form of 
the protein or protein analog. 

30 The OP-1 ar.d OP-1 analogs described herein can interact specifically 

with Type I and Type II receptors also known to interact with other 
morphogenic proteins (e.g., BMP2/BMP4 ) and activin. Thus invention also 
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contemplates the use of OP-1 and OP-1 receptor-binding analogs as competitors 
of specific BMP-receptor and activin-receptor interactions. As will be 
appreciated by those having ordinary skill in the art, these binding 
competitors may act as either agonists or antagonists (e.g., to inhibit an 
5 activin or BMP-mediated cellular response) . 

In another aspect, the invention contemplates binding partners having 
specific binding affinity for an epitope on the ligand-receptor complex. In a 
preferred embodiment, the binding partner can discriminate between the complex 
and the uncoraplexed ligand or receptor. In another embodiment, the binding 
10 partner has little or no substantial binding affinity for the uncomplexed 

ligand or receptor. In another preferred embodiment, the binding partner is a 
binding protein, more preferably an antibody. These antibodies may be 
monoclonal or polyclonal, may be intact molecules or fragments thereof (e.g.. 
Fab, Fab', (Fab) * 3 ), or may be biosynthetic derivatives, including, but not 
15 limited to, for example, monoclonal fragments, such as single chain F v 

fragments, referred to in the literature as sF v s, BABs and SCAs , and chimeric 
monoclonals, in which portions of the monoclonals are humanized (excluding 
those portions involved in antigen recognition (e.g., complementarity 
determining regions, "CDRs".) See, for example, U.S. Pat. Nos. 5,091,513 and 
20 5,132,405, the disclosures of which are incorporated herein by reference. 

Biosynthetic chimeras, fragments and other antibody derivatives may be 
synthesized using standard recombinant DNA methodology and/or automated 
chemical nucleic acid synthesis methodology well described in the art and as 
described below. 

25 In still another aspect, the invention provides molecules useful in the 

design and/or identification of receptor-binding morphogenic protein analogs 
as described below, as well as kits and methods, e,g., screening assays, for 
identifying these analogs. The molecules useful in these assays can include 
part or all of the receptor sequence of SEQ ID NO. 3, 5 or 7 , including amino 

30 acid sequence variants and OP-1 binding analogs and amino acid sequence 



variants thereof. 
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As described above, sequence variants are contemplated to have 
substantially the same binding affinity for OP-1 as the receptors represented 
by the sequences in SEQ. ID Nos. 3-7. OP-1 binding receptor analogs include 
other, known or novel Type I or Type II serine/ threonine kinase receptors 
5 having binding affinity and specificity for OP-1 as defined herein and which 
(1) share at least 40% amino acid identity with residues 23-122 of Seq. ID No. 
7, (2) are encoded by a nucleic acid that hybridizes under stringent 

conditions with a nucleic acid comprising the sequence defined by nucleotides 
256-552 of Seq ID No. 7; or (3) are encoded by a nucleic acid obtainable by 
10 amplification with one or more primer sequences defined by Seq. ID Nos. 12-15. 
Currently preferred for the assays of the invention are receptor sequences 
comprising at least the sequence which defines the extracellular, ligand 
binding domains of these proteins. The kits and assays may include just Type 
I receptors or both Type I and Type II receptors. Similarly, the kits and 
15 screening assays can be used in the design and/or identification of OP1- 

specific receptor analogs. The OP-1 receptor-binding analogs and OP-l-binding 
receptor analogs thus identified then can be produced in reasonable quantities 
using standard recombinant expression or chemical synthesis technology well 
known and characterized in the art. Alternatively, promising candidates can 
20 be modified using standard biological or chemical methodologies to, for 

example, enhance the binding affinity of the candidate analog as described in 
Example 10, below, and the preferred candidate derivative then produced in 
quantity. 

In still another aspect, the receptor and/or OPl-specific receptor 
25 analogs can be used in standard methodologies for affinity purifying and/or 
quantifying OP1 and 0P1 analogs. For example, the receptor’s ligand binding 
domain first may be immobilized on a surface of a well or a chromatographic 
column; ligand in a sample fluid then may be provided to the receptor under 
conditions to allow specific binding; non-specific binding molecules then 
30 removed, e.g., by washing, and the bound ligand then selectively isolated 

and/or quantitated. Similarly, OP1 and OPl analogs can be used for affinity 
purifying and/or quantifying OPl-specific receptors and receptor analogs. In 
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one embodiment, the method is useful in kits and assays for diagnostic 
purposes which detect the presence and/or concentration of OP1 protein or 
related morphogen in a body fluid sample including, without limitation, serum, 
peritoneal fluid, spinal fluid, and breast exudate. The kits and assays also 
5 can be used for detecting and/or quantitating OP-l-specif ic receptors in a 
sample . 

In still another aspect the invention comprises OPl-specific receptors 
and OP-l-binding receptor analogs useful in screening assays to identify 
organs, tissues and cell lines which express OPl-specific receptors. These 
10 cells then can be used in screening assays to identify ligands that modulate 
endogenous morphogen receptor expression levels, including the density of 
receptors expressed on a cell surface. Useful assay methodologies may be 
modeled on those described in PCT US92/07359, and as described below. 

The invention thus relates to compositions and methods for the use of 
15 morphogen-specific receptor sequences in diagnostic, therapeutic and 

experimental procedures. Active receptors useful in the compositions and 
methods of this invention can include truncated or full length forms, as well 
as forms having varying glycosylation patterns. Active receptors useful in 
the invention also include chimeric constructs as described below. Active 
20 OPl-specific receptors /analogs can be expressed from intact or truncated 
genomic or cDNA, or from synthetic DNAs in procaryotic or eucaryotic host 
cells, and purified, cleaved, refolded and oxidized as necessary to form 
active molecules. Useful host cells include prokaryotes, including coli 
and B. subtilis , and eukaryotic cells, including mammalian cells, such as 
25 fibroblast 3T3 cells, CHO, COS, melanoma or BSC cells, Hela and other human 
cells, the insect /baculovirus system, as well as yeast and other microbial 
host cell systems. 

Thus, in view of this disclosure, skilled genetic engineers now can, 
for example, identify and produce OPl-specific cell surface receptors or 
30 analogs thereof; create and perform assays for screening candidate OP1 

receptor-binding analogs and evaluate promising candidates and their progency 
in therapeutic regimes and preclinical studies; modulate the availability of 
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endogenous morphogen for cell surface interactions; modulate endogenous 
morphogen-specific cell surface receptor levels; elucidate the signal 
transduction pathway induced by morphogen-cell surface receptor binding; and 
modulate tissue morphogenesis in vivo. 

5 Brief Description of the Drawings 

Figure 1 is a schematic representation of the encoded ALK-2 , ALK-3, 
ALK-6 amino acid sequences, showing the signal sequence 10, the transmembrane 
domain 12, the extracellular ligand binding domain 14, and the intracellular 
serine/ threonine kinase domain 16; 

10 Figure 2 is a homology alignment of the extracellular domains of ALK-2, 

ALK-3, and ALK-6, aligned to maximize amino acid identity, wherein conserved 
amino acids are identified by boxes and conserved cysteines are indicated by 
asterisks; and 

Figure 3 is a homology alignment of the intracellular domain of ALK-2, 
15 ALK-3 and ALK-6, aligned to maximize amino acid identity, wherein conserved 
amino acids are boxed and the serine /threonine kinase domain is indicated by 
arrows . 

Detailed Description 

Disclosed herein are Type I and Type II receptors having binding 
20 specificity for true tissue morphogenic proteins, particularly OP1 and OP1- 

related proteins. It further has been determined that OP1 binds to a broader 
range of receptors than. other known tissue morphogens or TGF-& family members. 
The Type I receptors disclosed herein, can be used together with OP1 and OP1 
analogs for therapeutic, diagnostic and experimental uses as described herein 
25 below. Moreover, soluble forms of the OPl-binding receptor proteins, e.g., 

forms consisting essentially of the extracellular domain or a fragment thereof 
sufficient to bind OP1 with specificity, can be used as a soluble therapeutic 
morphogen antagonist, as described below. 

Following this disclosure, related OPl-specific receptors are 
30 . available, as are high and medium flux screening assays for identifying OP1 
analogs and OPl-specific receptor analogs. These analogs can.be naturally 
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occurring molecules, or they can be designed and biosynthetically created 
using a rational drug design and an established structure/function analysis. 
The analogs can be amino acid-based or can be composed in part or whole of 
non-proteinaceous synthetic organic molecules. Useful analogs also can 
5 include antibodies, preferably monoclonal antibodies (including fragments 

thereof, e.g.. Fab, Fab', and (Fab)^)# or synthetic derivatives thereof, such 

as monoclonal single chain F v fragments known in the art as sF v s, BABs, and 

SCAs (see below), and bispecific antibodies or derivatives thereof. When 
these antibodies mimic the binding activity of OP-1 to a cell surface receptor 
10 without inducing the biological response . OP-1 does upon binding, the antibody 
can compete for OP-1 binding and act as an antagonist. These antibodies or 
derivatives thereof also can mimic OP-1 both in receptor binding and signal 
transduction, in which case the antibody acts as an OP-1 agonist. The 
antibodies and derivatives also can be used for inducing the morphogenic 
15 cellular response by crosslinking receptors to morphogenic proteins, 

particularly OP1 and OPl-related proteins to form either homo- or hetero- 
complexes of the Type I and Type II receptors. 

The OPl-binding receptor sequences described herein (ALK2 , ALK3, ALK6 ) 
also can be used to create chimeric sequences, wherein, for example, part or 
20 all of either the extracellular domain or the intracellular domain is a non- 
ALK sequence or is created from two or more ALK sequences . These chimeric 
receptors can be synthesized using standard recombinant DNA methodology 
and/or automated chemical nucleic acid synthesis methodology well described in 
the art and as disclosed below. Chimerics can be used, for example, in OP1 
25 analog assays, wherein the OPl-binding extracellular domain is coupled to a 
non-ALK intracellular domain that is well characterized and/or readily 
detectable as a second messenger response system, as described below. 

Chimerics also can be used, for example, in high flux OPl analogs screens and 
as part of purification protocols, wherein a soluble ligand binding domain of 
30 an OPl-specific receptor is immobilized onto a support e.g., by covalent or 
non-covalent interactions, with a chromatographic matrix or the well surface 
of a 96-well plate. When immobilized onto a chromatographic matrix surface. 
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the receptor fragment can be used in a protocol to isolate OP1 or OP1 
analogs. When immobilized on a well surface the receptor fragment is 
particularly useful in a screening assay to identify receptor-binding OP1 
analogs in a standard competition assay. 

5 The true tissue morphogenic proteins contemplated to be useful in the 

methods and compositions of the invention include forms having varying 
glycosylation patterns and varying N-termini. The proteins can be naturally 
occurring or biosynthetically derived, and can be produced by expression of 
recombinant DNA in prokaryotic or eukaryotic host cells. The proteins are 
10 active as a single species (e.g., as homodimers), or combined as a mixed 

species. Useful sequences and eucaryotic and procaryotic expression systems 
are well described in the art. See, for example, US Patent Nos. 5,061,911 and 
5,266,683 for useful expression systems. 

Particularly contemplated herein are OP1 and OPl-related sequences. 

15 Useful OP1 sequences are recited in US Pat Nos. 5,011,691; 5,018,753 and 

5,266,683; in Ozkaynak et al. (1990) EMBO J 9:2085-2093; and Sampath et al . 
(1993) PNAS 90: 6004-6008. OP-1 related sequences include xenogenic homologs, 
e.g.; 60A, from Drosophila, VJharton et al . (1991) PNAS ^8:9214-9218; and 

proteins sharing greater than 60% identity with OP1 in the C-terminal seven 
20 cysteine domain, preferably at least 65% identity. * Examples of OP-1 related 
sequences include BMP5 , BMPS (and its species homolog Vgr-1 , Lyons et al . 

(1989) PNAS 86:4554-4558), Celeste, et al . (1990) PNAS 87:9843-9847 and PCT 

international application W093/00432; OP-2 (Ozkaynak et al . (1992) J.Biol. 

Chem. 267 : 13198-13205 ) and OP-3 (PCT international application WO94/06447) . 

2 5 As will be appreciated by those having ordinary skill in the art, chimeric 
constructs readily can be created using standard molecular biology and 
mutagenesis techniques combining various portions of different morphogenic 
protein sequences to create a novel sequence, and these forms of the protein 
also are contemplated herein. 

30 A particularly preferred embodiment of the proteins contemplated by the 

invention includes proteins whose amino acid sequence in the cysteine-rich C- 
terminal domain has greater than 60% identity, and preferably greater than 65% 
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identity with the amino acid sequence of OPS (OP-1 sequence defining the C- 
terminal conserved six cysteines, e.g., residues 335-431 of Seq. ID No. 9). 

In another preferred aspect, the invention contemplates osteogenic 
proteins comprising species of polypeptide chains having the generic amino 
5 acid sequence herein referred to as "OPX" which accommodates the homologies 

between the various identified species of the osteogenic OP1 and OP2 proteins, 
and which is described by the amino acid sequence presented below and in 
Sequence ID No. 11. 
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and wherein Xaa at res. 2 = (Lys or Arg); Xaa at res. 3 = (Lys or Arg) ; Xaa at 
res. 11 = (Arg or Gin); Xaa at res. 16 = (Gin or Leu); Xaa at res. 19 = (lie 
or Val) ; Xaa at res. 23 = (Glu or Gin}; Xaa at res. 26 = (Ala or Ser); Xaa at 

res. 35 = (Ala or Ser); Xaa at res. 39 = (Asn or Asp); Xaa at res. 41 = (Tyr 

35 or Cys); Xaa at res. 50 = (Val or Leu); Xaa at res. 52 = (Ser or Thr) ? Xaa at 

res. 56 = (Phe or Leu); Xaa at res. 57 = (lie or Met); Xaa at res. 58 = (Asn 

or Lys); Xaa at res. 60 = (Glu, Asp or Asn); Xaa at res. 61 = (Thr, Ala or 

Val); Xaa at res. 65 = (Pro or Ala); Xaa at res. 71 = (Gin or Lys); Xaa at 

res. 73 = (Asn or Ser); Xaa at res. 75 = (lie or Thr); Xaa at res. 80 = (Phe 

or Tyr); Xaa at res. 82 = (Asp or Ser); Xaa at res. 84 = (Ser or Asn); Xaa at 

res. 89 = (Lys or Arg); Xaa at res. 91 = (Tyr or His); and Xaa at res. 97 = 

(Arg or Lys) . 
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In still another preferred aspect, the invention contemplates 
osteogenic proteins encoded by nucleic acids which hybridize to DMA or RNA 
sequences encoding the C-terminal seven cysteine domain of OP1 or OP2 under 
stringent hybridization conditions. 

5 A brief description of the various terms of OP-1 useful in the 

invention is described below: 

OP1 - Refers generically to the family of osteogenically active 

proteins produced by expression of part or all of the hOPl gene. Also referred 
to in related applications as "OPI" and "OP-1". 

10 OP1-PP - Amino acid sequence of human OPI protein (prepro form), Seq. ID 

No. 9, residues 1-431. Also referred to in related applications as "OP1-PP" 
and "OPP*. 

OPl-18Ser - Amino acid sequence of mature human OPI protein, Seq. ID No. 9, 
residues 293-431. N-terminal amino acid is serine. Originally identified as 
15 migrating at 18 kDa on.SDS-PAGE in COS cells. Depending on protein 

glycosylation pattern in different host cells, also migrates at 23kDa, 19kDa 
and 17kDa on SDS-PAGE. Also referred to in related applications as "OP1-18." 

OPl-16Ser ; OPl-16Ala; OP1-16 Met; OP1-16 leu; OPl-16Val - N-terminally 
truncated mature human OPI protein species defined, respectively, by residues 
20 300-431; 316-431; 315-431; 313-431 and 318-431. 

OPS - Amino acid sequence defining the C-terminal six cysteine domain, 
residues 335-431 of Seq. ID No. 9. 

OP7 - Amino acid sequence defining the C-terminal seven cysteine domain, 
residues 330-431 of Seq. ID No. 9. 

25 Soluble form OPI - mature dimeric OPI species having one or, preferably two 
copies of pro domain, e.g., at least residues 158-292 of. Seq. ID No. 9, 
preferably residues 48-292 or 30-292, non-covalently complexed with the 



dimer. 
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The cloning procedure for obtaining OPl-binding ALK nucleic acid 
sequences, means for expressing receptor sequences, as well as other material 
aspects concerning the nature and utility of these sequences, including how to 
make and how to use the subject matter claimed, will be further understood 
5 from the following, which constitutes the best mode currently contemplated for 
practicing the invention. 

Example 1 . IDENTIFICATION OF ALK-1 , ALK-2 , ALK-3 and ALK-6 

The cloning and characterization of ALK-1, -2, -3, and -6 receptors are 
described in detail in ten Dijke et al . (1993) Oncogene 8 :2879-2887; and 

10 (1994) Science 264:101-104. The general structures of these proteins is 

described in Fig. 1, and the sequence alignments between the ALK genes are 
shown in Figs 2 and 3. These molecules have similar domain structures: an N- 
terminal predicted hydrophobic signal sequence (von Heijne (1986) Nucl . Acids 
Res. 14 : 4683-4690) is followed by a relatively small extracellular cysteine- 
15 rich ligand binding domain, a single hydrophobic, transmembrane region (Kyte & 

Doolittle (1982) J. Mol. Biol. 157 , 105-132) and a C-terminal intracellular 

portion, which consists almost entirely of a kinase domain (Figure 3) . 

The extracellular domains of these receptors, defined essentially by 
residues 22-118 (SEQ. ID No. 1 ) for ALK-1; residues 16-123 (SEQ ID No. 3) for 
20 ALK-2; residues 24-152 (SEQ. ID No. 5) for ALK-3; and residues 23-122 {SEQ ID 

No. 7) for ALK-6, have cysteine-rich regions, but sequence similarity varies 

among the proteins. For example, ALK-3 and ALK-6 share a high degree of 
sequence similarity in their extracellular domains (46% identity) whereas ALK- 
2 shows less similarity with ALK 3 or ALK 6 (see Fig. 2.) 

25 The positions of many of the cysteine residues in these receptors can 

be aligned, indicating that the extracellular domains likely adopt a similar 
structural configuration. 

The intracellular domains of these receptors are characterized by a 
serine/ threonine kinase, defined essentially by residues 204-494 (SEQ. ID. No. 
1) for ALK-1; residues 210-510 (SEQ ID No. 3) for ALK-2; residues 236-527 (SEQ 
ID No. 5) for ALK-3; and residues 206-497 (SEQ ID No. 7) for ALK-6. The 
catalytic domains of kinases can be divided into 12 subdomains with stretches 
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of conserved amino-acid residues. The key motifs are found in 
serine/ threonine kinase receptors indicating that they are functional kinases. 
The consensus sequence for the binding of ATP (Gly-X-Gly-X-X-Gly in subdomain 
I followed by a Lys residue further downstream in subdomain II) is found in 
5 all the A1>KS . Moreover, ALK- 1 , ALK-2, AX.K- 3 and ALK-6 have the sequence 
motifs or similar motifs HRDLKSKN {Subdomain VIB) and GTKRYNAPE (Subdomain 
VIII) , that are found in most of the serine/ threonine kinase receptors and can 
be used to distinguish them from tyrosine kinase receptors. Two short inserts 
in the kinase domain (between subdomain VIA and VIB and between X .and XI are 
10 unique to members of this serine/ threonine kinase receptor family. In the 

intracellular domain, these regions, together with the juxtamembrane part and 
C-terminal tail, are the most divergent between family members. 

Type II serine/ threonine kinase receptors known in the art are 
described and referenced herein above . Example 2 . RECEPTOR EXPRESSION 
15 A. General Considerations 

Receptor DNA, or a synthetic form thereof, can be inserted, using 
conventional techniques well described in the art (see, for example, Maniatis 
(1989) Molecular Cloning A Laboratory Manual ), into any of a variety of 
expression vectors and transfected into an appropriate host cell to produce 
20 recombinant protein polypeptide chains, including both full length and 

truncated forms thereof. Shortened sequences, for example, can be used for 
the production of soluble receptor fragments . 

Useful host cells include E. coli , Saccharomyces cerevisiae , Pichia 
pastoris , the insect/baculovirus cell system, myeloma cells, and various other 
25 mammalian cells. The full length forms of the proteins of this invention 

preferably are expressed in mammalian cells, as disclosed herein. Soluble 
forms may be expressed from both mammalian or bacterial cell systems. The 
vector additionally may include various sequences to promote correct 
expression of the recombinant protein, including transcription promoter and 
30 termination sequences, enhancer sequences, preferred ribosome binding site 
sequences, preferred mRNA leader sequences, preferred protein processing 
sequences, preferred signal sequences for protein secretion, and the like. 
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The DNA sequence encoding the gene of interest also may be manipulated to 
remove potentially inhibiting sequences or to minimize unwanted secondary 
structure formation. The recombinant morphogen receptor also may be expressed 
as a fusion protein. After translation, the protein may be purified from the 
5 cells themselves or recovered from the culture medium. The DNA also may 
include sequences which aid in expression and/or purification of the 
recombinant protein. One useful sequence for example, is a hexa-His (His 6 ) 

sequence, which adds a histidine tail to allow affinity purification of the 
protein on an IMAC Cu2+ column (see below.) 

10 For example, the DNA encoding the extracellular domain may be inserted 

into a suitable expression vector for transformation into a prokaryote host 
such as E. coli or B, subtilis , to produce a soluble, morphogen binding 
fragment. The DNA may expressed directly or may be expressed as part of a 
fusion protein having a readily cleavable fusion junction. An exemplary 
15 protocol for prokaryote expression using MR-1 DNA is provided below. 

Recombinant protein is expressed in inclusion bodies and may be purified 
therefrom using the technology disclosed in U.S. Patent No. 5,013,653, for 
example . 

The DNA also may be expressed in a suitable mammalian host. Useful 
20 hosts include fibroblast 3T3 cells, (e.g., NIH 3T3, from CRL 1658) COS (simian 
kidney ATCC, CRL-1650) or CH0 (Chinese hamster ovary) cells (e.g., CHO-DXB11, 
from Lawrence Chasin, Proc. Nat*l. Acad. Sci. (1980) T7 (7) :4216-4222) , mink- 
lung epithelial cells (MVlLu) , human foreskin fibroblast cells, human 
glioblastoma cells, and teratocarcinoma cells. Other useful eukaryotic cell 
25 systems include yeast cells, the insect/baculovirus system or myeloma cells. 

To express an OP1 -specific cell surface receptor, the DNA is subcloned 
into an insertion site of a suitable, commercially available vector along with 
suitable promoter/ enhancer sequences and 3' termination sequences. Useful 
promoter/ enhancer sequence combinations include the CMV promoter (human 
30 cytomegalovirus (MIE) promoter) present, for example, on pCDM8, as well as the 
mammary tumor virus promoter (MMTV) boosted by the Rous sarcoma virus LTR 
enhancer sequence (e.g., from Clontech, Inc., Palo Alto). A useful induceable 
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promoter includes, for example, A Zn 2 * induceable promoter, such as the Zn 2 * 
metallothionein promoter (Wrana et al . (1992) Cell 71 ; 1003-1014 . ) Other 

induceable promoters are well known in the art and can be used with similar 
success. Expression also may be further enhanced using transactivating 
5 enhancer sequences. The plasmid also preferably contains an amplifiable 
marker, such as DHFR under suitable promoter control, e.g., SV40 early 
promoter (AT CC #37148) . Transfection, cell culturing, gene amplification and 
protein expression conditions are standard conditions, well known in the art, 
such as are described, for example in Ausubel et al . , ed. , Current Protocols 
10 in Molecular Biology , John Wiley & Sons, NY (1989) . Briefly, transfected 

cells are cultured in medium containing 5-10% dialyzed fetal calf serum (FCS) , 
and stably transfected high expression cell lines obtained by amplification 
and subcloning and evaluated by standard Western and Northern blot analysis. 
Southern blots also can be used to assess the state of integrated receptor 
15 sequences and the extent of their copy number amplification. 

The expressed protein then is purified using standard procedures. A 
currently preferred methodology uses an affinity column, such as a ligand 
affinity column or an antibody affinity column, the bound material then 
washed, and receptor molecules selectively eluted in a gradient of increasing 
20 ionic strength, changes in* pH or addition of mild denaturants . Alternatively, 
where a useful anchor sequence has been added to the DNA, such as a (His)g 

sequence, the column may be a standard affinity column such as Cu 2 "* IMAC 
column. Here, for example, the cell culture media containing the recombinant 

protein is passed over a Cu 2 * IMAC column {for example, prepared with 25 mM 
25 imidazole) . The bound protein then is washed with a compatible solution and 
eluted with EDTA . The anchor sequence can be removed by a standard chemical 
or enzymatic procedure. 

Mammalian cell expression is preferred where morphogen receptor 
expression on a cell surface is desired. For example, cell surface expression 
may be desired to test morphogen or morphogen analog binding specificity for a 
cell surface receptor under in vivo conditions. Cell surface expression also 
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may be most efficacious for medium flux cellular screen assays as described 
below. 

B. 1 Exemplary Mammalian Cell Culture 

The receptors tested in Examples 8 and 9 described below were expressed 
5 in (1) COS-1 cells; (2) mink lung epithelial cells (MvlLu) ; (3) AG1518 human 
foreskin fibroblasts; (4) MG- 63 human osteosarcoma cells; (5) PC12 rat 
pheochromocytoma cells {all obtained from American Type Culture Collection, 
Rockville, MD) ; (6) U-1240 MG human glioblastoma cells (Bengt Westermark, et 

al. (1988) Cancer Research £8:3910-3918); (7) Tera-2 teratocarcinoma cells 
10 (clone 13, Thompson et al . (1984) J Cell Sci 72:37-64); (8) MC3T3-E1 cells 

(Sudo et al. (1983) J. Cell Biol. 96:191-198, and (9) ROS 17/2.8 rat 
osteosarcoma cells (Majeska et al . (1985) Endocrinol ogy 116 :170-179. The ROS 

cells were cultured in Ham's F12 medium containing 14mM HEPES buffer, 2.5 mM 
L-glutamine, 1.1 mM CaCl 2 , 5% fetal bovine serum and antibiotics; MC3T3-E1 

15 cells were cultured in a-MEM with 10% fetal bovine serum and antibiotics, and 
Tera-2 cells were cultured in 5% C0 2 atmosphere at 37oC in a-MEM containing 
10% fetal bovine serum, 100 units/ml of penicillin and 50 mg/ml of 
streptomycin, using tissue culture dishes pretreated with 0.1% swine skin 
gelatin (Sigma) in phosphate-buffered saline. Unless otherwise specified, 

20 cells were cultured in DMEM containing 10% fetal bovine serum and antibiotics. 
B . 2 Transfection of cDNA 

The receptors tested in Example 8.1 (ALK 1-6, daf-4) were transfected 
as follows. Transient expression plasmids of ALK-1 to -6 and daf-4 were 
generated by subcloning into an expression vector (pSV7d, Truett et al . (1985) 

25 DNA 4:333-349) or into the pcDNA I expression vector (Invitrogen, San Diego) . 
For transient transfection, COS-1 cells were transfected with 10 mg each of 
plasmids by a calcium phosphate precipitation method using a mammalian 
transfection kit (Stratagene, La Jolla), following the manufacturer's 
protocol. One day after transfection, the cells were used for the affinity 
30 labeling and cross-linking experiments. 

Example 3 . ANTIBODY PRODUCTION 
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A. General Considerations 

Antibodies capable of specifically binding the receptor molecules, 
ligand molecules, or the ligand-receptor complex itself, useful as analogs and 
useful in immunoassays and in the immunopurification of morphogen receptors 
5 described may be obtained as described below. 

Where antibodies specific to the OPl-specific receptors are desired, 
but which do not interfere with ligand binding, the antigenic sequence 
preferably comprises the juxtamerabrane sequence. Where antibodies capable of 
competing for ligand binding are desired, the ligand binding domain may be 
10 used as the antigen source. Where antibodies to the complex are desired, the 
complex itself preferably is used as the antigenic sequence and candidate 
antibodies then tested for cross reactivity with uncomplexed ligand and 
receptors versus the ligand-receptor complex. Finally, bispecific antibodies 
may used to complex ligand to a cell surface receptor (Type I or Type II) 

15 . and/or to target an agent or ligand to cells or tissue expressing a Type I or 
Type II morphogen-specific receptor. Preferred bispecific antibody derived 
molecules are single chain binding sites described in US Pat No. 5,091,513 and 
5,132,405, the disclosures of which are incorporated hereinabove by reference. 

Antibodies useful as OP1 receptor-binding analogs may be obtained using 
20 the receptor ligand binding domain as the immunogen source and testing 
receptor-binding analogs for their ability to compete with OP1 in a 
competition binding assay. Similarly, where antibodies useful as OPl-specific 
receptor analogs are desired, OP1 is the immunogen source and the antibody 
candidate tested in a competition assay with receptor protein. 

25 Polyclonal antibodies specific for a morphogen receptor of interest may 

be prepared generally as described below. Each rabbit is given a primary 
immunization (e.g., 500mg) of antigen in 0.1% SDS mixed with 500 ml Complete 
Freund's Adjuvant. The antigen is injected intradermally at multiple sites on 
the back and flanks of the animal. The rabbit is boosted after a month with 
30 500mg of antigen in the same manner using incomplete Freund's Adjuvant. Test 

bleeds are taken from the ear vein seven days later. Two additional boosts 
and test bleeds are performed at monthly intervals until antibody against the 
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antigenic sequence is detected in the serum using a standard Western blot. 

Then, the rabbit is boosted monthly with 100 mg/ml of antigen and bled (15 ml 
per bleed) at days seven and ten after boosting. 

Similarly, monoclonal antibodies specific for a given morphogen 
5 receptor molecule of interest may be prepared as described below: A mouse is 

given two injections of the antigenic sequence. The protein preferably is 
recombinant ly produced. Where it is desired that the antibody recognize an 
epitope on the morphogen binding surface of a receptor an antigenic fragment 
derived from the extracellular domain preferably is provided. The first 
10 injection contains lOOmg of antigen in complete Freund's adjuvant and is given 
subcutaneously. The second injection contains 50 mg of antigen in incomplete 
adjuvant and is given intraperitoneally . The mouse then receives a total of 
230 mg of antigen in four intraperitoneal injections at various times over an 
eight month period. One week prior to fusion, the mouse is boosted 
15 intraperitoneally with antigen (e.g., 100 mg) and may be additionally boosted 
with an antigen-specific peptide conjugated to bovine serum albumin with a 
suitable crosslinking agent. This boost can be repeated five days (IP) , four 
days (IP), three days (IP) and one day (IV) prior to fusion. The mouse spleen 
cells then are fused to commercially available myeloma cells at a ratio of 1:1 
20 using PEG 1500 (Boehringer Mannheim, Germany) , and the fused cells plated and 
screened for ALK-specific antibodies, e.g., using ALK-2, ALK-3 or ALK-6 as 
antigen. The cell fusion and monoclonal screening steps readily are performed 
according to standard procedures well described in standard texts widely 
available in the art. (See, for example. Guide to Protein Purification Murray 
25 P. Deutscher, ed.. Academic Press, San Diego, 1990. 

B . Exemplary ALK-Specific Antisera 

Antibodies used in the assay of Example 8 were obtained as follows . 
Rabbit antisera against ALK-1 to -6 were made against synthetic peptides 
corresponding to the divergent, intracellular juxtamembrane parts. (ALK-1: 

30 residues 119-141; ALK-2: residues 151-172; ALK-3 residues 181-202; ALK-6: 

residues 151-168.) Peptides were synthesized with an Applied Biosystems 430 A 
Peptide Synthesizer using t-butoxycarbonyl chemistry, and purified by reverse 
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phase HPLC. The synthetic peptides were coupled to keyhole limpet hemocyanin 
(Calbiochem-Behring) using glutaraldehyde, as decribed by Gullick et al . 

(1985) EMBO J 4: 2869-2877. The coupled peptides then were mixed with 
Freund's adjuvant and used to immunize rabbits using standard methodologies. 

5 Example 4 . OP1-RECEPTOR BINDING ASSAYS 

Ligand binding specificity is determined by evaluating the ability of a 
receptor molecule to bind a specific ligand, and the ability of that ligand to 
compete against itself and other molecules which bind the receptor. Useful 
ligands will have a binding affinity for a soluble morphogen receptor 
10 extracellular domain such that dissociation constant (Kd) is less than about 

10~^M, preferably less than 5x 10" 7 M. Where stronger binding interaction is 

desired, preferred affinities are defined by a Kd of 10“®-10“^M. OPl-related 
proteins are expected to be able to bind with specificity to multiple 
different receptor molecules, although likely with differing affinities. 

15 Ligand binding specificity can be assayed as follows, essentially 

following standard protocols well described in the art and disclosed, for 
example, in Legerski et al . (1992) Biochem. Biophys. Res. Comm 183 :672-679 and 

Frakar et al., (1978) Biochem. Biophys. Res. Comm. 80 :849-857. In the ligand 
binding assays, a ligand having a known, quantifiable affinity for a morphogen 
20 receptor molecule of interest is labelled, typically by radioiodination 
( 125 I), e.g., by chromogenic or fluorogenic labeling, or by metabolic 

labelling, e.g., 35 S, and aliquots of cells expressing the receptor on their 
surface are incubated with the labelled ligand, in the presence of various 
concentrations of unlabelled potential competitor ligand. In the assays 
25 described in Examples 8 and 9, below, this competitor typically is the 

candidate morphogen analog or an aliquot from a broth or extract anticipated 
to contain a candidate morphogen analog. 

Alternatively, a crosslinking agent may be used to covalently link the 
ligand to the bound receptor, and the cros si inked complex then 
30 immunoprecipitated with an antibody specific to the ligand, receptor, or 
complex. (See, Example 8.) 
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A standard, exemplary protocol for determining binding affinity is 
provided below. Briefly, cells expressing a receptor on their cell surface 
are plated into 35 mM dishes and incubated for 48 hours in DMEM (Dulbecco's 
modified Eagle medium) plus 10% fetal calf serum. Purified morphogen, here, 

5 e.g., OP-1, or an OPl-analog is iodinated with Na^^I by chloramine T 

oxidation, preferably having a specific activity of about 50-100 raCi/mg, 
essentially following the protocol of Frolik et al . (1984) J. Biol. Chem. 

595 : 10995-11000 . Labelled morphogen then is purified using standard 
procedures, e.g., chromatographically . Plated cells then are washed twice 
10 with physiologically buffered saline in the presence of 0.1% BSA, and 

incubated at 22oC in the presence of BSA, buffer and labelled morphogen (1 ng) 
and various concentrations (e.g., 0-10mg/ml) of unlabelled competitor, e.g., 
unlabelled morphogen or candidate ligand analogs. Following binding, cells 
are washed three times with cold buffer, solubilized in 0.5 ml of 0.5 N NaOH, 
15 removed from the dish, and radioactivity determined by gamma or scintillation 
counter. Data then are expressed as percent inhibition, where 100% inhibition 
of specific binding is the difference between binding in the absence of 
competitor and binding in the presence of a 100-fold molar excess of 
unlabelled morphogen. Binding parameters preferably are determined using a 
20 computer program such as LIGAND (Munsun et al . (1980) Anal. Biochem. 107 :220- 

259.) 

Where the receptor cell surface binding domain is to be provided as a 
soluble protein, the assay can be performed in solution, most readily as an 
immunoprecipitation assay. In currently preferred assays the morphogen 
25 molecule is labelled and incubated with unlabelled receptor and candidate 
morphogen analogs. Receptor-specif ic antibody then is provided to the 
solution to precipitate the receptor-morphogen complex and the amount of 
labelled morphogen in the precipitated complex determined using standard 
detection means. 

30 Where the receptor or ligand is to be used in an affinity isolation 

protocol, the molecule preferably is immobilized on a surface, preferably a 
matrix surface over which sample fluid will flow, allowing the ligand of 
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interest to bind, at letting nonbinding components pass through as effluent. 
The complex then can be removed intact or the ligand selectively removed with 
a desired eluant. 

4.1 Screening Assay Considerations 

5 In the analog screening assays described in Example 9 below, the 

preferred protocol for assaying ligand-receptor binding is a standard 
competition or radioimmunoassay (RIA) . Here the OP1 is labelled and the 
relative binding affinity of a candidate OP1 analog ligand in a sample is 
measured by quantitating the ability of the candidate (unlabelled ligand 
10 analog) to inhibit binding of the labelled ligand (competitor morphogen) by 
the receptor. In performing the assay, fixed concentrations of receptor and 
labelled morphogen are incubated in the absence and presence of unknown 
samples containing candidate ligands. Sensitivity can be increased by 
preincubating the receptor with candidate ligand before adding the labelled 
15 morphogen. After the labelled competitor has been added, sufficient time is 
allowed for adequate competitor binding, and then free and bound labelled 
morphogen are separated, and one or the other is measured. Useful morphogen 
labels include radioactive labels, chromogenic or fluorogenic labels, and 
conjugated enzymes having high turnover numbers, such as horseradish 
20 peroxidase, alkaline phosphatase, or b-galactosidase, used in combination with 
chemiluminescent or fluorogenic substrates. In the same manner, OPl-specific 
receptor analogs can be assayed for their affinity for OP1 in competition 
assays with labelled OP1 specific receptors. 

Assays for evaluating a candidate OP1 receptor-binding analog's ability 
25 to mimic OP-1 in signal- transduction across a membrane are exemplified in 
detail in Example 9.2, below. Briefly, the assay involves use of a cell 
(1) known to express an OP-l-specif ic receptor; or (2) which can be modified 
to express an OP-l-specific receptor, and/or (3) which can induce an OP-1- 
mediated cellular response. In the assay, the ability of a candidate analog 
30 to induce an OP-l-specific cellular response is monitored. Numerous OP-1 

responsive cells and OP-l-mediated inducible cellular and biochemical markers 
are known and described in the art. Alternatively, and as exemplified below. 
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an OP-1 inducible reporter gene system can be constructed and used to 
advantage in the assay. 

4.2 Diagnostic Assay Considerations 

The ability to detect morphogens in solution provides a valuable tool 
5 for diagnostic assays, allowing one to monitor the level of morphogen free in 
the body, e.g., in serum and other body fluids. For example, OP-1 has been 
detected in a number of different body fluids, including serum and spinal 
fluid, including cerebro- spinal fluid, saliva, milk and other breast exudates. 
(See, for example, PCT US92/07432, PCT US93/07231, W094/06449) . 

10 As one example, OP-1 is an intimate participant in normal bone growth 

and resorption. Thus, soluble OP-1 is expected to be detected at higher 
concentrations in individuals experiencing high bone formation, such as 
children, and at substantially lower levels in individuals with abnormally low 
rates of bone formation, such as patients with osteoporosis, aplastic bone 
15 disease, or osteopenia. Monitoring the level of OP-1 in serum thus provides a 
means for evaluating the status of bone tissue and bone homeostasis in an 
individual, as well as a means for monitoring the efficacy of a treatment to 
regenerate damaged or lost bone tissue. Alternatively, the level of OP-1 in 
bone tissue can be assessed in a bone tissue biopsy. 

20 Similarly, OP-1 and other morphogens have been identified in brain 

tissue. In particular, OP-1 is expressed and/or localized in developing and 
adult rat brain and spinal cord tissue, in the hippocampus, substantia nigra 
and the adendema glial cells, as well as associated with astrocytes and the 
■ extracellular matrix surrounding neuronal cell bodies. (See, PCTUS93/07331) . 
25 Thus, monitoring the level of OP-1 in spinal fluid or associated with a nerve 
tissue biopsy can provide means for evaluating the status of nerve tissue in 
an individual, as well as means for monitoring the efficacy of a nerve 
regeration or repair therapy. 

For serum assays, the serum preferably first is partially purified to 
30 remove some of the excess, contaminating serum proteins, such as serum 

albumin. Preferably the serum, is extracted by precipitation in ammonium 
sulfate (e.g., 45%) such that the complex is precipitated. Further 
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purification can be achieved using purification strategies that take advantage 
of the differential solubility of soluble morphogen complex or mature 
morphogens relative to that of the other proteins present in serum. Further 
purification also can be achieved by chromatographic techniques well known in 
5 the art. The sample fluid then can be assayed for OP1 using the OPl-specific 
receptor (s) and binding assays as described herein. 

For a tissue biopsy, cells can be collected and stained with a labelled 
OP-1 -specific antibody or receptor molecule. Alternatively, the OP-1 protein 
selectively can be extracted and quantitated as described above. 

10 Morphogens useful in the binding/screening assays contemplated herein 

include the soluble forms of the protein, e.g., the mature dimeric species 
complexed with one or two copies of the pro domain, the mature dimeric species 
alone, and truncated forms comprising essentially just the C-terminal active 
domain. 

15 Example 5 . TRANSMEMBRANE SIGNAL INDUCTION ASSAYS/0P1 MIMETICS 

The kinase activity of the intracellular domains of the OPl-specific 
receptors can be tested in an autophophorylation assay as described by Mathews 
et al. { PCT/US92/03825 , published November '26, 1992) . Briefly, the DNA 

fragment encoding at least the intracellular kinase domain of an OPl-specific 

20 receptor is subcloned into pGEX-2T (Smith et al.{1988) Gene 67:31-40) to 

create a fusion protein between the putative kinase domain and glutathione s- 
transferase (GST) . The plasmid is introduced into E. coli and the expressed 
fusion protein purified using glutathione affinity chromatography. About 100- 
200 ng of fusion protein or purified GST then are incubated with 25 mCi 

25 (gp 32 P) ATP in 50mM tris, lOmM MgCl 2 buffer for 30 minutes at 37° C. Products 

then are analyzed by gel electrophoresis and autoradiography. The fusion 
protein, but not GST alone, becomes phophorylated, indicating that the kinase 
domain is functional . Phosphoamino acid analysis then can be performed to 
determine the predominant amino acid being phosphorylat'ed . Similar assays can 

30 be performed using similar fusion constructs expressed in mammalian cells.. 

Various signaling transduction assays are provided in Example 9, below. 
An assay also can be developed for testing kinase activity transduction upon 
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ligand binding using a ligand-induced kinase activity assay known in the art. 
Here, the ability of OP-1 analog to induce phosphorylation upon binding to the 
receptor is tested. 

See, for example, various assays for measuring ligand-induced kinase 
5 activity described by Accili et al. (1991) J. Biol.Chem. 266 : 434-439 and 

Nakamura et al. (1992) J.Biol. Chem. 267 : 18924-18928, . For example, ligand- 
induced kinase activity (e.g., receptor autophosphorylation) can be measured 
in vitro by incubating purified receptor in the presence and absence of ligand 
(here, OP1 or OP1 analog, e.g., 10” 7 M) under conditions sufficient to allow 
10 binding of the ligand to the receptor, followed by exposure to 3 ^P-ATP (e.g., 
lOOmCi in the presence of lOmM Tris-HCl (pH 7.6), lOmM MgC^, lOmM MnCl 2 , ImM 
dithiothreitol , 0.15M NaCl 2 , 0 . 1% Triton X-100 and 3% glycerol) and the amount 

of phosphorylation measured, e.g., by SDS polyacrylamide gel electrophoresis 
and autoradiography following immmunoprecipitation with antiphosphoserine , 

15 antiphosphothreonine or antiphosphotyrosine antibody (e.g., commercially 

available or made using standard antibody methodologies.) While a low level 
of autophosphorylation may be detected in the absence of ligand, incubation 
with ligand is anticipated to significantly increase (e.g., 5-20 fold 
increase) the amount of phosphorylation detected. 

20 In another assay for detecting ligand-induced receptor 

autophosphorylation, involving intact cells, receptor DNA is transfected into 
a suitable host cell, e.g., a fibroblast, which then is grown under standard 
conditions to create a confluent monolayer of cells expressing the receptor on 
their cell surface. On the day of the experiment, cells are incubated with or 

25 without ligand (e.g., OPl or OP1 analog, e.g., 10" 7 M) at 37°C, and then 

quickly washed with a "stopping solution" containing ATP (e.g., 0.1M NaF, 4 
mM EOT A, lOmM ATP, 10 mM sodium orthovanadate, 4mM sodium pyrophosphate) . The 
cells then are frozen in a dry ice/ethanol bath, solubilized and the receptors 
immunoprecipitated, e.g., with an antireceptor antibody, as described herein. 
The immune complexes then are segregated, washed, separated by gel 
electrophoresis using standard procedures and transferred to a membrane for 
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Western blot analysis using standard procedures. Phosphorylation of the 
receptor then can be visualized by immunodetection with a suitable antibody 
(e.g., antiphosphoserine, antiphosphothreonine or ant iphospho tyros ine ) , as 
described above. The bound antibody (e.g., bound antiphosphoserine, 

5 antiphosphothreonine or antiphosphotyrosine) then can be detected with *25 j 

labelled protein A, followed by autoradiography. The amount of phosphorylated 
receptor detected is anticipated to be significantly greater (5-20 fold 
increase} in receptors incubated with ligand than receptors exposed to ATP in 
the absence of ligand. 

10 Ligand-induced receptor phosphorylation of exogenous substrates 

similarly can be assayed essentially using the methodology described herein. 
Here, a suitable substrate (e.g., a synthetic polypeptide containing serine, 
threonine or tyrosine amino acids) is provided to the receptor following 
ligand exposure and prior to incubation with ATP. The substrate subsequently 
15 can be segregated by immunoprecipitation with an antibody specific for the 
substrate, and phosphorylation detected as described above. As for 
autophosphorylation, the amount of phosphorylated substrate detected following 
ligand incubation is anticipated to be greater than that detected for 
substrates exposed to receptors in the absence of ligand. 

20 Alternatively, a reporter gene construct can be created to assay 

transmembrane signal induction. Here, the expression control elements for an 
OP-1' inducible protein marker is fused to the open reading frame sequence for 
any reporter gene, and induction of the reporter gene expression then assayed. 
Useful reporter genes include the luciferase gene or GAL 4 as well as other, 

25 easily characterizable markers. 

Example 6. CHIMERIC RECEPTOR MOLECULES 

Chimeric receptor molecules, e.g., comprising an ALK or ALK analog 
extracellular and transmembrane region and, for example, part or all of an 
intracellular domain from another, different receptor or an intracellular 
domain from a different cell surface molecule, may be constructed using 
standard recombinant DMA technology and/or an automated DNA synthesizer to 
construct the desired sequence. As will be appreciated by persons skilled in 
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the art, useful junctions include sequences within the transmembrane region 
and/or sequences at the junction of either the intracellular or the 
extracellular domains. Also envisioned are chimers where the extracellular 
domain or the intracellular domains themselves are chimeric sequences. 

5 Chimeric sequences are envisioned to be particularly useful in 

screening assays to determine candidate binding ligands (e.g., OP1 analogs, 
see below) , where the non-receptor intracellular domain provides a suitable 
second messenger response system that is easy to detect. Potentially useful 
other second messenger response systems include those which, when activated, 

10 induce phosphoinositide hydrolysis, adenylate cyclase, guanylate cyclase or 
ion channels . 

Chimeric receptor molecules have particular utility in gene therapy 
protocols. For example, a population of cells expressing a chimeric morphogen 
receptor molecule on their surface and competent for expressing a desired 
15 phenotype can be implanted in a mammal at a particular tissue locus. By 
careful choice of the ligand binding domain used on these receptors a 
physician can administer to the individual a morphogen agonist capable of: 

(1) binding to the chimeric receptor alone and (2) stimulating the 
proliferation and/or differentiation of the implanted cells without affecting 
20 endogenous cell populations. 

Example 7. CONSIDERATIONS FOR IDENTIFYING OTHER OP1 SPECIFIC RECEPTORS IN 
NUCLEIC ACID LIBRARIES 

Identification of ALK Type I receptors that can bind OP-1 allows one 
to identify other morphogen receptor sequences in different species as well as 
25 in different tissues. The OPl-binding ALK sequences themselves can be used as 
a probe or the sequence may be modified to account for other potential codon 
usage {e.g., human codon bias.) Currently preferred probe sequences are 
those which encode the receptor's extracellular domain ^ 

Probes based on the nucleic acid sequence of Seq. ID Nos.l, 3, 5 or 7 
30 can be synthesized on commercially available DNA synthesizers, e.g. Applied 
Biosystems model 381A, using standard techniques, e.g. Gait, Oligonucleotide 
Synthesis: A Practical Approach , ( IRL Press, Washington D.C., 1984). It is 
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preferable that the probes are at least 8-50 bases long, more preferably 18-30 
bases long. Probes can be labeled in a variety of ways standard in the art, 
e.g. using radioactive, enzymatic or colormetric labels as described, for 
example, by Berent et al, (May/June 1985) Biotechniques ; 208-220; and 
5 Jablonski et al, (1986) Nucleic Acids Research 14: 6115-6128. 

Preferably, low stringency conditions are employed when screening a 
library for morphogen receptor sequences using a probe derived from OP1- 
binding receptor. Preferred ALK-specific probes are those corresponding to 
bases encoding the extracellular domain ( " ECD " ) , or encoding a unique 
10 (nonhomologous) sequence within the cytoplasmic domain. Useful probes may be 
designed from bases encoding the juxtamembrane region, for example. The 
probe may be further modified to use a preferred species codon bias. 
Alternatively, probes derived from the serine /threonine kinase domain can be 
used to identify new members of the receptor kinase family which can be 
15 screened for OP1 binding using the methods described in Example 8. 

For example, for a probe of about 20-40 bases a typical 
prehybridization, hybridization, and wash protocol is as follows: 

(1) prehybridization: incubate nitrocellulose filters containing the denatured 
target DMA for 3-4 hours at 55oC in 5X Denhardt's solution, 6X SSC (20X SSC 

20 consists of 175g NaCl, 88.2 g sodium citrate in 800 ml H 2 0 adjusted to pH. 7.0 

with 10N NaOH) , 0.1% SDS, and 100 mg/ml denatured salmon sperm DNA, 

(2) hybridization: incubate filters in prehybridization solution plus probe at 
42oC for 14-48 hours, (3) wash; three 15 minutes washes in 6X SSC and 0.1% SDS 
at room temperature, followed by a final 1-1.5 minute wash in 6x SSC and 0.1% 

25 SDS at 55oC. Other equivalent procedures, e.g. employing organic solvents 
such as formamide, are well known in the art. 

Alternatively, morphogen receptor-specific DNA can be amplified using a 
standard PCR (polymerase chain reaction) methodology such as the one disclosed 
herein, to amplify approximately 500 base pair fragments. As for the 
30 hybridization screening probes described above, the primer sequences 

preferably are derived from conserved sequences in the serine /threonine kinase 
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domain. The primers disclosed herein, in Seq. ID Nos. 12-15 are envisioned to 
be particularly useful, particularly in combination.. 

Examples of useful PCR amplifications, including the use of the primers 
recited herein, are disclosed in ten Dijke, et al . (1993) Oncogene 8:2879-2887 
5 and (1994) Science 264 : 101-104 . , and which also describe the isolation 
protocols for ALK-1, ALK-2 , ALK-3 and ALK-6 . 

7.1 TISSUE DISTRIBUTION OF MORPHOGEN RECEPTORS 

Determining the tissue distribution of OPl-specific receptors can be 
used to identify tissue and cell sources which express these receptors, to 
10 identify new, related OPl-specific receptor molecules, as well as to identify 
target tissues for OPl-receptor interactions under naturally occurring 
conditions. The OP-1 specific receptor molecules (or their mRNA transcripts) 
readily are identified in different tissues using standard methodologies and 
minor modifications thereof in tissues where expression may be low. For 
15 example, protein distribution can be determined using standard VJestern blot 
analysis or immunohistological detection techniques, and antibodies specific 
to the morphogen receptor molecules of interest. Similarly, the distribution 
of morphogen receptor transcripts can be determined using standard Northern 
hybridization protocols and transcript-specific probes or by in . situ 
20 hybridization. 

Any probe capable of hybridizing specifically to a transcript, and 
distinguishing the transcript of interest from other related transcripts can 
be used. Because the morphogen receptors described herein likely share high 
sequence homology in their intracellular domains, the tissue distribution of a 
25 specific morphogen receptor transcript may best be determined using a probe 
specific for the extracellular domain of the molecule. For example, a 
particularly useful ALK-specific probe sequence is one derived from a unique 
portion of the 5' coding sequence, the sequence corresponding to the 
juxtamembrane region, or the 5' or 3' noncoding sequences. The chosen 
30 fragment then is labelled using standard means well known and described in the 
art and herein . 
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Using these receptor-specific probes, which can be synthetically 
engineered or obtained from cloned sequences, receptor transcripts can be 
identified and localized in various tissues of various organisms, using 
standard methodologies well known to those having ordinary skill in the art. 

5 A detailed description of a suitable hybridization protocol is described in 
Ozkaynak, et al . , (1991) Biochem. Biophys. Res. Cornmn . 179 : 116-123 , and 

Ozkaynak, et al . (1992) J Biol. Chem . 267 ; 25220-25227 . Briefly, total RNA is 
prepared from various tissues (e.g., murine embryo and developing and adult 
liver, kidney, testis, heart, brain, thymus, stomach) by a standard 
10 methodologies such as by the method of Chomczynski et al . ((1987) Anal , 

Biochem 162:156-159} and described below. Poly (A)+ RNA is prepared by using 
oligo (dT) -cellulose chromatography (e.g., Type 7, from Pharmacia LKB 
Biotechnology, Inc.). Poly (A)+ RNA (generally 15 mg) from each tissue is 
fractionated on a 1% agarose /formaldehyde gel and transferred onto a Nytran 
15 membrane (Schleicher & Schuell). Following the transfer, the membrane is 

baked at 80oC and the RNA is cross-linked under UV light (generally 30 seconds 

at 1 mW/cm^) . Prior to hybridization, the appropriate probe is denatured by 
heating. The hybridization is carried out in a lucite cylinder rotating in a 
roller bottle apparatus at approximately 1 rev/min for approximately 15 hours 
20 at 37oC using a hybridization mix of 40% formamide, 5 x Denhardts , 5 x SSPE, 

and 0.1% SDS. Following hybridization, the non-specific counts are washed off 
the filters in 0.1 x SSPE, 0.1% SDS at 50oC. 

Example 8 . Demonstration That ALK-2 , ALK-3 and ALK-6 
are OPl-Binding Receptors 

25 The tissue morphogenic proteins OPl and BMP 4 were tested for specific 

binding interaction with the ALK receptors in receptor- transfected cells 
(where the receptor is over-expressed) , and in nontransfected cells. It 
previously was known that ALK- 5 interacted specifically with TGFfil and ALK-2 
and ALK-4 interacted specifically with activin. In the experiment, complexes 
30 were crosslinked and immuno-precipitated with an ALK-specific antibody as 

described below. To date, no binding with ALK-1 under the conditions of this 
protocol have been detected. 
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Binding and affinity cross-linking using disuccinimidyl suberate 
(Pierce Chemical Co.) were performed using standard methods (e.g., Franzen et 
al. (1993) Cell 75:681-692 and Ichijo et al . (1990) Exp. Cell Res. 187:10995- 
11000.) A typical protocol is described below. Modifications from this 
5 protocol for individual experiments were standard changes anticipated to 
produce the same result as for the recited procedure. Briefly, cells in 
multi-well plates were washed with binding buffer (e.g., phosphate buffered 
saline containing 0.9 mM CaCl 2 * 0.49raM MgCl 2 and 1 mg/ml bovine serum albumin 
(BSA) ) , incubated on ice in the same buffer with labelled ligand, in the 
10 presence and absence of excess unlabelled ligand for sufficient time for the 
reaction to equilibrate (e.g., 3 hours.) Cells were washed and. the 
crosslinking was done in the binding buffer without BSA together with 0.28 roM 
disuccinimidyl suberate for 15 min on ice. Cells were harvested by addition of 
1 ml of detachment buffer (10 mM Tris-HCl, pH 7.4, 1 mM EDTA, 10% glycerol, 

15 0.3 mM PMSF.) Cells then were pelleted by centrifugation, then resuspended in 

50 ml of solubilization buffer (125 mM NaCl, lOmM Tris-HCl, pH.7.4, ImM EDTA, 
1% Triton X-100, 0.3 mM PMSF, 1% Trasylol) and incubated for 40 minutes on 
ice. Cells were centrifuged again and supernatants subjected to analysis by 
standard SDS-gel electrophoresis using 4%-15% polyacrylamide gels, followed by 
20 autoradiography. 

Cell lysates obtained following affinity cross-linking via the general 
protocol described above were immunoprecipitated using antisera against ALKs 
(e.g., raised against the ALK juxtamembrane region), or directly analyzed by 
SDS-gel electrophoresis using gradient gels consisting of 5-12% or 5-10% 

25 polyacrylamide. The gels were fixed and dried, and then subjected to 
autoradiography or analysis using phosphor Imager (Molecular Dynamics) . 

8 . 1 Binding of OP-1 and/or BMP-4 to ALKs in Transfected Cells. 

COS-1 cells transfected with ALK cDNA were tested for the binding of 
125 j-op-i and *2-I-BMP-4 , the presence or absence of co-transfected Type II 
30 receptor DNA : daf-4 cDNA or ActRII (Estevez et al . (1993) Nature 365:644-649 

and Attisano et al. (1992) Cell £8: 97-108, disclosing the DNA sequence for 
these Type II receptors and the disclosure of which is incorporated herein by 
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reference.) Since the cross-linked complexes were sometimes difficult to 
visualize because of high background, samples were immunoprecipitated by 
antisera against each ALK. The results are presented in Table I below. In 
the Table, "N/T n means "not tested". Binding was specific as determined by 
5 standard competition assays. The values represented by , " + '• , 

"+++", and are all qualitative descriptors of the relative amount of 

radioactivity measured when the crosslinked molecules were gel electrophoresed 
and subjected to autoradiography. . More radioactivity measured indicates a 
stronger binding interaction detected. In the Table the strength of binding 
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In the absence of daf-4, OP-1 bound to ALK-6 , whereas BMP- 4 bound to 
ALK- 3 and ALK-6. Weaker binding of OP-1 to ALK-2 was also observed. Other 
20 ALKs did not bind OP-1 or BMP-4 in the absence of Daf-4. When ALK cDNAs were 
co- transfected with the daf-4 cDNA, increased binding of OP-1 to ALK-2 and 
ALK-6 was seen. In addition, ALK-3 also was found to bind OP-1 in the co- 
transfected cells. Similarly, increased binding of BMP-4 to ALK-3 and ALK-6 
could be observed. Co- transfection of two different types of ALKs did not 
25 further increase the binding of OP-1 or BMP-4. In cells co-transfected with 
the DNA for ActRII and ALK-2, ALK-3 or ALK-6, OPl-receptor binding was 
enhanced. 

The sizes of the cross-linked complexes were slightly higher for ALK-3 
than for ALK-2 and ALK-6, consistent with its slightly larger size. Complexes 
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of about 95 kDa as well as multiple components of 140-250 kDa were also co- 
immunoprecipitated with certain of the ALKs. 

In standard competition assays performed with the Type I receptors in 
the presence and absence of the Type II receptors, the binding of OP-1 and 
5 BMP-4 could be competed with excess amounts of unlabeled OP-1, verifying the 
binding specificity of these interactions where they occurred. 

These results demonstrate that ALK-2 , ALK-3 and ALK-6 can serve as 
Type I receptors for OP-1. Notably, ligand binding apparently can be enhanced 
in the presence of Type II receptors. Moreover, OP-1 is able to interact with 
10 both a "bone morphogen" Type II receptor {daf 4) and an "activin" Type II 
receptor (ActRII), whereas, for example, activin only interacts with the 
ActRII Type II receptor. The data indicate that OP1 has a broader spectrum of 
receptor {Type I and Type II) binding affinities than do other tissue 
morphogenic proteins, or other members of the TGF-fc family. It is anticipated 
15 that OP1 will have specific binding interactions with other "activin-binding** 
or "bone morphogen -binding” Type II receptors. 

The ability of OP-1 to bind to Type I, Type II receptors having binding 
specificity for activin or BMP 4 but not TGF-B, indicates that OP-1 and OP-1 
analogs will be useful as competitors of activin or BMP 4 binding to cell 
20 surface receptors. In particular, OP-1 and OP-1 analogs will be useful for 
competing with activin-ALK-2 binding and/or act ivin-ALK-2 /ActRII (or other 
Type II) receptor binding; and for competing with BMP 4 (or BMP2) -ALK-6 
binding, and/or BMP2/4-ALK-6/daf 4 (or other Type II) binding. The OP-1 
competitors may act as antagonists (e.g., binding competitors unable to induce 
25 the signal transduction cascade upon binding) or as agonists (e.g., able both 
to bind and induce the signal transduction cascade) . 

8 . 2 Identification of OPl-5pecific Receptors in Nontransf ected Cells 

ALK-5 has been shown to- bind TGF-B1, and ALKs 2, 4 bind activin A with 
high affinities in nontransfected cells ( ten Dijke , Oncogene, ( 1993_; Science , 
30 _[1994) referenced herein above.) In the present experiment, the binding 

affinity of OP1 and/or BMP4 to receptors in nontransfected cells was 
demonstrated as follows. The results corroborate the transfected cell data, 
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verifying that OP1 interacts specifically with ALK-2, ALK-3 and ALK-6, but not 
ALK-4 or ALK-5 . 

MC3T3-E1 osteoblasts are well characterized cells known to respond to 
OP-1 and BMP-4 in the induction of alkaline phosphatase activity (Paralkar 

5 (1991) PNAS B: 339*7-3401.) These cells were affinity labeled using ^ 2 ^I-OP-l 

as described herein above, and cross-linked complexes of about 75 kDa were 
seen, which were immunoprecipitated only with the ALK-2 antiserum. Tera-2 
teratocarcinoma cells and MvlLu cells responded to OP-1 as measured by 
production of plasminogen activator inhibitor- 1 (PAI-1) . Similar to MC3T3-E1 

10 cells, cross-linked complexes using 125 I-OP-l in Tera-2 teratocarcinoma were 
immunoprecipitated only by the ALK-2 antiserum. 

On the other hand, cross-linked complexes using 12 ^I-OP-l to mvILu 
cells were immunoprecipitated by ALK-2 as well as ALK-3 and ALK-6 antisera. 
MvlLu cells are known to express ALK-4 and ALK-5 (Ebner (1993) Science 

15 260 : 1344-1348) , but cross-linked complexes with 125 I-OP-l were not 

precipitated by antisera against these receptors. Similarly, cross-linked 
complexes in U-1240 MG glioblastoma cells were immunoprecipitated by ALK-2 and 
ALK-6 antisera, and weakly by ALK-3 antiserum. In contrast, cross-linking of 

j.^I-OP- 1 to AG1518 human foreskin fibroblasts yielded weak immunoprecipitated 
20 components only by ALK-3 antiserum. Type II receptor-like components of about 
95 kDa as well as high molecular^ weight complexes of 140-250 kDa co- 
immunoprecipitated with certain ALKs in the Tera-2 cells, MvlLu cells and U- 
1240 MG cells. 

Receptors for BMP-4 have also been investigated using nontransf ected 
25 cells. Cross-linked complexes using 125 I-BMP-4 to MC3T3-E1 cells and AG1518 
human foreskin fibroblasts were immunoprecipitated only by ALK-3. On the 

other hand, cross-linking of 125 I-BMP-4 to Tera-2 cells did not yield any 
immunoprecipitated components by antisera against ALKs. 

125J-OP-1 and/or 125 I-BMP-4 also were demonstrated by affinity cross- 
30 linking to interact specifically with receptors in other BMP- responsive cells, 
e.g., MG- 6 3 osteosarcoma cells and PC12 pheochromocytoma cells. A summary of 
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the binding of ALKs to OP-1 or BMP-4 in different cell types is shown in Table 
II, below. In the Table, "N/T" means not tested, and receptors presented in 
brackets indicate comparatively lower quantities of radioactive complexes 
detected. 



TABLE II 



Cell lines 



Mouse osteoblasts 
(MC3T3-E1 

Mink lung epithelial 
cells (MvlLu) 

Human glioblastoma 

Human teratocarcinoma 
(Tera-2 ) 

Human foreskin 
fibroblasts (AG1518) 

Rat osteosarcoma 
(R0S17/2 . 8 ) 



Binding of OP-1 
ALK-2 

ALK-2 , -3, -6 
ALK2 , [-3], -6 

ALK2 

[ ALK3 ] 

ALK2, [-3] 



Binding of BMP-4 
ALK-3 

N/T 

N/T 

ALK3 

N/T. 



Example 9 . OP1, OP1 -SPECIFIC RECEPTOR ANALOG SCREENING ASSAYS 

The present invention is useful to determine whether a ligand, such as 
a known or putative drug, is capable of' binding to and/or activating an OP1- 
specific cell surface receptor as described herein. Ligands capable of 
10 specific binding interaction with a given OPl-specific receptor (e.g., ALK-2, 
ALK-3, ALK-6) are referred to herein as OP1 analogs and can be used for 
therapeutic and diagnostic applications. Some analogs will have the ability 
to stimulate morphogenetic activity in the cell, mimicking both the receptor 
binding and signal transducing activity of OP1 . These are referred to OP1 
15 agonists or mimetics . Others will have strong binding affinity but will not 

stimulate morphogenesis, these are OP1 antagonists. The analogs can be amino 
acid-based, or they can be composed of non-proteinaceous chemical structures. 

The methods and kits described below similarly can be used to identify 
OPl-specific receptor analogs, capable of mimicking the binding affinity of 
20 ALK-2, ALK-3 or ALK-6 for OP1 . The analogs can be provided to a mammal to 
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interact with serum-soluble OP1, effectively sequestering the protein and 
modulating its availability for cell surface interaction. 

Transfection of an isolated clone encoding a morphogen receptor into 
the cell systems described above provides an assay system for the ability of 
5 ligands to bind to and/or to activate the receptor encoded by the isolated DNA 
molecule. Transfection systems, such as those described above, are useful as 
living cell cultures for competitive binding assays between known or candidate 
drugs and ligands which bind to the receptor and compete with the binding of 
known morphogens, which are labeled by radioactive, enzymatic, spectroscopic 
10 or other reagents. Membrane preparations containing the receptor and isolated 
from transfected cells are also useful in these competitive binding assays. 
Alternatively, and currently preferred, purified receptor molecules or their 
ligand binding extracellular domains can be plated onto a microtiter well 
surface, in a modification of a sandwich assay, e.g., as a competition assay, 
15 such as an RIA, described above. Finally, as described above, solution 

assays, and using only the receptor extracellular domain, also may be used to 
advantage in these assays. Functional assays of second messenger systems or 
their sequelae in transfection systems act as assays for binding affinity and 
efficacy in the activation of receptor function or efficacy in the antagonism 
20 of receptor function. Such a transfection system constitutes a “drug 

discovery system", useful for the identification of natural or synthetic 
compounds with potential for drug development that can be further modified or 
used directly as therapeutic compounds to activate or inhibit the natural 
functions of the receptor encoded by the isolated DNA molecule. 

25 Once such candidate drugs (e.g., OP-1 or receptor-binding analogs 

thereof) are identified, they can be produced in reasonable, useful quantities 
using standard methodologies known in the art. Amino acid-based molecules can 
be encoded by synthetic nucleic acid molecules, and expressed in a recombinant 
expression system as described herein above or in the art. Alternatively, 
such molecules can be chemically synthesized, e.g., by means of an automated 
peptide synthesizer, for example. Non-amino acid-based molecules can be 
produced by standard organic chemical synthesis procedures. Where the 
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candidate molecule is of undetermined structure, or composition, its 
composition readily can be determined by, for example, mass spectroscopy. Two 
approaches to identifying analogs typically are practiced in the art: high 

flux screens and rational design of ligand mimetics. High flux screens 
5 typically screen naturally sourced materials or chemical banks for their 

ability to bind a protein of interest, here, e.g., the receptor. Typically, 
compounds are obtained from a range of sources, e.g., chemical banks, 
microbial broths, plant and animal extracts, and the like. In a high flux 
screen typically, purified receptor, preferably the soluble, ligand binding 
10 extracellular domain, is plated onto a microtiter well surface and a standard 
volume of a sample solution to be tested then is added. Also added is a 
standard volume having a known quantity of a purified ligand known to bind the 
receptor with specificity. Preferably the ligand is labelled with a substance 
that is readily detectable by automated means (e.g., radiolabel, chromophoric, 
15 fluorometric, enzymatic or spectroscopic label.) The wells then are washed 
and the amount of label remaining after washing or the amount of label 
remaining associated with the receptor then is detected. Positive scores are 
identified by the ability of the test substance to prevent interaction of the 
labelled ligand with the receptor. The screening assays can be performed 
20 without undue experimentation, using standard molecular and cell biology tools 
in common use in the art. For example, screening assays can be performed in 
standard 96-well plates. Fifteen such plates reasonably can be set up at a 
time to perform multiple screening assays in parallel. Thus, with only 10-11 
reiterations of the screening assay 15,625 (5 6 ) compounds can be screened for 
25 their binding affinity. Even allowing for a maximum incubation time of 2 
hours, all 15,625 compounds reasonably can be assayed in a matter of days. 

High flux screens exploit both the high degree of specificity of the 
labelled ligand for its receptor, as well as high throughput capacity of 
computer driven robotics and computer handling of data. Candidate analogs 
30 identified in this manner, then can be analyzed structurally and this 

information used to design and to synthesize analogs having enhanced potency, 
increased duration of action, increased selectivity and reduced side effects. 
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Candidates also can be used in a rational design program as described below. 
Finally, candidate analogs also can be tested to determine morphogenetic 
effect, if any, as described below. 

The second approach to the identification of analogs uses a rational 
5 design approach to create molecules capable of mimicking the binding effect of 
OP1 with an OPl-specific receptor. Here the relevant structure for receptor 
binding is analyzed to identify critical sequences and structures necessary 
for binding- activity and this information can be used to design and synthesize 
minimal size morphogen analogs. As for candidate compounds in the high flux 
10 assay, design candidates can be tested for receptor binding activity as 

described above. As described above, a candidate sequence can be further 
modified by,' for example standard biological or chemical mutagenesis 
techniques to create a candidate derivative having, for example, enhanced 
binding affinity or another preferred characteristic. 

15 Antibodies capable of interacting specifically with the receptor and 

competing with OP1 binding also can be used as an analog. Antibodies can be 
generated as described above. 

OPl analogs may be evaluated for their ability to mimic OP1 or to 
inhibit OPl binding (e.g., agonists or antagonists) by monitoring the effect 
20 of the analogs on cells bearing an OPl-specific receptor (e.g., ALK-2, ALK-3 
or ALK-6.) OPl agonists are anticipated to have utility in any application 
where tissue morphogenesis is desired, such as in the regeneration of damaged 
tissue resulting from mechanical or chemical trauma, degenerative diseases, 
tissue destruction resulting from chronic inflammation, cirrhosis, 

25 inflammatory diseases, cancer and the like, and in the regeneration of 

tissues, organs and limbs. OPl antagonists are envisioned to have utility in 
applications where tissue morphogenesis is to be limited as, for example, in 
the treatment of malignant transformations including, but not limited to, 
osteosarcomas and Paget's disease. 

30 Several exemplary systems for assaying the ability of a candidate 

analog transduce an OP-l-specif ic signal across the cellular membrane are 



described below. 
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9.1 Induction of Osteoblast Differentiation Markers 

For example, OP1 is known to preferentially induce differentiation of 
progenitor cells, including embryonic mesenchymal cells and primary 
osteoblasts (see, for example, PCT US92/07432) As one example, OP1 analogs 
5 can be tested for their ability to induce differentiation of primary 

osteoblasts, by measuring the ability of these analogs to induce production of 
alkaline phosphatase, PTH-mediated cAMP and osteocalcin, all of which are 
induced when primary osteoblasts are exposed to OP-1, 60A or DPP. 

Briefly, the assays may be performed as follows. In this and all 
10 examples involving osteoblast cultures, rat osteoblast-enriched primary 

cultures preferably are used. Although these cultures are heterogeneous in 
that the individual cells are at different stages of differentiation, these 
cultures are believed to more accurately reflect the metabolism and function 
of osteoblasts in vivo than osteoblast cultures obtained from established cell 
15 lines. Unless otherwise indicated, all chemicals referenced are standard, 

commercially available reagents, readily available from a number of sources, 
including Sigma Chemical, Co., St. Louis; Calbiochera, Corp., San Diego and 
Aldrich Chemical Co., Milwaukee. 

Rat osteoblast-enriched primary cultures are prepared by sequential 
20 collagenase digestion of newborn suture-free rat calvaria (e.g., from 1-2 day- 
old animals, Long-Evans strain, Charles River Laboratories, Wilmington, MA) , 
following standard procedures, such as are described, for example, in Wong et 
al . , (1975) PNAS 72:3167-3171. Rat osteoblast single cell suspensions then 

are plated onto a multi-well plate (e.g., a 24 well plate) at a concentration 
25 of 50,000 osteoblasts per well in alpha MEM (modified Eagle's medium, Gibco, 
Inc., Long Island) containing 10% FBS (fetal bovine serum), L-glutamine and 
penicillin/streptomycin. The cells are incubated for 24 hours at 37 °C, at 
which time the growth medium is replaced with alpha MEM containing 1% FBS and 
the cells incubated for an additional 24 hours so that the cells are in serum- 
30 deprived growth medium at the time of the experiment. 

Alkaline Phosphatase Induction of Osteoblasts 
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The cultured cells in serum- free medium are incubated with OP1, OP1 
analog or a negative control, using a range of concentrations. For example, 
0.1, 1.0, 10.0, 40.0 or 80.0 ng OP-l/ml medium typically are used. 72 hours 
after the incubation period the cell layer is extracted with 0.5 ml of 1% 

5 Triton X-100. The resultant cell extract then, is centrifuged, and 100 ml of 
the extract is added to 90 ml of paranitrosophenylphospate (PNPP) /glycerine 
mixture and incubated for 30 minutes in a 37 °C water bath and the reaction 
stopped with 100 ml NaOH. The samples then are run through a plate reader 
(e.g., Dynatech MR700 plate reader, and absorbance measured at 400 nm, using 
10 p-nitrophenol as a standard) to determine the presence and amount of alkaline 
phosphate activity. Protein concentrations are determined by the Biorad 
method. Alkaline phosphatase activity is calculated in units/mg protein, 
where 1 unit=l nmol p-nitrophenol liberated/30 minutes at 37 °C. OP-1 induces 
a five -fold increase in the specific activity of alkaline phosphate by this 
15 method. Agonists are expected to have similar induction effects. Antagonists 
should inhibit or otherwise interfere with OP1 binding, and diminished 
alkaline phophatase induction should result when the assay is performed with 
an antagonist in the presence of a limiting amount of OP1 . 

Induction of PTH-Medlated cAMP . 

20 The effect of a morphogen analog on parathyroid hormone-mediated cAMP 

production in rat osteoblasts in vitro may be demonstrated as follows. 

Rat osteoblasts are prepared and cultured in a multiwell plate as 
described above. The cultured cells then are divided into three groups: (1) 

wells which receive, for example, 1.0, 10.0 and 40.0 ng OP-l/ml medium); (2) 

25 wells which receive the candidate analog at various concentration ranges; and 
(3) a control group which receives no additional factors. The plate is then 
incubated for another 72 hours. At the end of the 72 hours the cells are 
treated with medium containing 0.5% bovine serum albumin (BSA) and lmM 3- 
isobutyl-1 -me thylxan thine for 20 minutes followed by the addition into half of 
30 the wells of human recombinant parathyroid hormone (hPTH, Sigma, St. Louis) at 
a concentration of 200 ng/ml for 10 minutes. The cell layer then is extracted 
from each well with 0.5 ml of 1% Triton X-100. The cAMP levels then are 
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determined using a radioimmunoassay kit (e.g., Amersham, Arlington Heights, 
Illinois) . OP-1 doubles cAMP production in the presence of PTH. Agonists 
are expected to have similar induction effects. Antagonists are expected to 
inhibit or otherwise interfere with OP1 binding, and diminished cAMP 
5 production should result when the assay is performed with an antagonist in the 
presence of limiting the amount of OPl. 

Induction of Osteocalcin Production 

Osteocalcin is a bone-specific protein synthesized by osteoblasts which 
plays an integral role in the rate of bone mineralization in vivo . 

10 Circulating levels of osteocalcin in serum axe used as a marker for osteoblast 
activity and bone formation in vivo . Induction of osteocalcin synthesis in 
osteoblast-enriched cultures can be used to demonstrate morphogenic efficacy 
in vitro . 

Rat osteoblasts are prepared and cultured in a multi-well plate as 
15 above. In this experiment the medium is supplemented with 10% FBS , and on day 
2, cells are fed with fresh medium supplemented with fresh 10 mM b- 
glycerophosphate (Sigma, Inc.). Beginning on day 5 and twice weekly 
thereafter, cells are fed with a complete mineralization medium containing all 
of the above components plus fresh L (+) -ascorbate, at a final concentration of 
20 50mg/ml medium. OPl or OPl analog then is added to the wells directly, e.g., 
in 50% acetonitrile (or 50% ethanol) containing 0.1% trif luoroacetic acid 
(TFA) , at no more than 5ml OPl /ml medium. Control wells receive solvent 
vehicle only. The cells then are re- fed and the conditioned medium sample 
diluted 1:1 in standard radioimmunoassay buffer containing standard protease 
25 inhibitors and stored at -20°C until assayed for osteocalcin. Osteocalcin 
synthesis is measured by standard radioimmunoassay using a commercially 
available osteocalcin-specific antibody and car. be confirmed by Northern blot 
analysis to calculate the amount of osteocalcin mRNA produced in the presence 
and absence of OP-1 or OPl analog. OP-1 induces a dose-dependent increase in 
30 osteocalcin production (5-fold increase using 25 ng of OP-1 protein/ml) , and a 
20- fold increase in osteocalcin mRNA. Agonists are expected to have similar 
induction effects; antagonists are expected to inhibit or otherwise interfere 
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with OP1 binding, thereby substantially interfering with osteocalcin induction 
in the presence of a limiting amount of OP1. 

Mineralization is determined on long term cultures (13 day) using a 
modified von Kossa staining technique on fixed cell layers: cells are fixed 

5 in fresh 4% paraformaldehyde at 23o C for 10 min, following rinsing cold 0.9% 
NaCl. Fixed cells then are stained for endogenous alkaline phosphatase at pH 
9.5 for 10 min, using a commercially available kit (Sigma, Inc.) Purple 
stained cells then are dehydrated with methanol and air dried. After 30 min 
incubation in 3% AgNOj in the dark, ^O-rinsed samples are exposed for 30 sec 

10 to 254 nm UV light to develop the black silver-stained phosphate nodules. 
Individual mineralized foci (at least 20 mm in size) are counted under a 
dissecting microscope and expressed as nodules/culture. OP-1 induces a 20- 
fold increase in initial mineralization rate. Agonists are expected to have 
similar induction effects; antagonists are expected to inhibit or otherwise 
15 interfere with OPl binding, thereby inhibiting mineralization induction in 
the presence of a limiting amount of OPl. 

9.2 Induction of a Constructed Reporter Gene 

Alternatively, a reporter gene construct can be used to determine the 
ability of candidate molecule to induce signal transduction across a membrane 
20 following receptor binding. For example, PAI-1 protein, (Plasminogen 

Activator Inhibitor-1) expression can be induced by OP-1 in MvlLu- cells (see 
above). Also, as demontrated above, these cells express ALK-2 , -3 and -6 
surface receptors. In addition, preliminary studies indicate that ALK-1 , when 
overexpressed in a chemically mutagenized derivative of these cells, also 
25 apparently mediates PAI-1 induction in the presence of OPl. 

Accordingly, PAI-1 promoter elements can be fused to a reporter gene 
and induction of the reporter gene monitored following incubation of the 
transfected cell with a candidate analog. As one example, the lucif erase 
reporter gene can be used, in, for example, the construct p3TP-Lux described 
30 by Wrana et al . (1992) Cell 71: 1003-1014 and Attisano et al. (1993) Cell 74: 

671-680. This reporter gene construct includes a region of the human PAI-1 
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gene promoter in combination with three sets of tetradecanoyl phorbol - 
acetate responsive elements upstream of the lucifrase open reading frame. 

In a typical assay, transfected cells starved in DMEM containing 0.1% 
fetal bovine serum and antibiotics (e.g., 100 units/ml penicillin and 50 jig/ml 
5 streptomycin) for 6 hrs . , and then exposed to ligand for 24 hr. Lucif erase 
activity in the cell lysate then is measured using a luminometer in the 
luciferase assay system, according to the manufacturer's protocol (Promega) . 

In MvlLu mutant cells, *R mutant" cells co-transfected with ALK-2 and Act RII, 
OP1 mediated induction of luciferase activity. 

10 9.3 Inhibition of Epithelial cell proliferation 

OP1 is known to inhibit epithelial cells. Thus, the ability of a 
candidate analog to inhibit cell proliferation, as measured by 3 H- thymidine 
uptake by an epithelial cell can be used in an assay to evaluate signal 
transduction activity of the candidate. Analogs competent to inhibit 
15 epithelial cell growth are contemplated to have particular utility in 

therapeutic applications where limitation of a proliferating cell population 
is desired. Such applications include chemotherapies and radiation therapies 
where limiting the growth of a normally proliferating population of cells can 
protect these cells from the cytotoxic effects of: these cancer therapies. 

20 (see e.g., W094/06420) . In addition, psoriasis and other tissue disorders 

resulting from uncontrolled cell proliferation, including benign and malignant 

neoplasties, can be modulated by use of an OP1 analog. 

As an example, mink lung epithelial cell growth is inhibited by OP-1, 
(see, PCT US93/088S5,- W094/06420.) As described above, derivatives of these 
25 cells [e.g., *R-4 mutants", clone 4-2, Laiho et al. (1990) J. Biol. Chem. 265 : 

18518-18524] can be transfected with DNA encoding OPl-specific receptors and 
induced to express these receptors. The transfected cells, then can be 
assayed for a candidate analog's ability to block cell growth. As one 
example,- when R-4 cells are transfected with ALK-3 under a Zn 2 *- inducible 
30 promoter, and induced to express the receptor following induction with ZnCl 2 , 
cell growth can be inhibited in the presence of OP1 in a dose dependent 
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manner. Preliminary experiments with ALK-1 indicates that this receptor also 
can mediate this OP-l-specif ic effect. 

In a typical assay, cells are seeded in 24-well cell culture plates at 
a density of 10 4 cells per well in DMEM with 10 % F3S , and incubated 
5 overnight. The medium is replaced with DMEM containing 0.2 % FBS and lOOuM 

ZnLC 2 , and the cells are incubated for 5 h, after which the medium is replaced 
with fresh DMEM containing 0.2 % FBS, lOOuM ZnCL 2 and various concentrations 
of OP-1 or an analog candidate. After 16 h of incubation, 0.25 ci of 

[ 3 H] thymidine (Amersham) are added and the cells incubated for an additional 2 
10 h. Thereafter, the cells are fixed in 10% trichloroacetic acid for more than 
15 min on ice, and solubilized with 1 M NaOH. The cell extracts are 

neutralized with 1 M HC1 and 3 H radioactivity determined in a liquid 
scintillation counter. 

Example 10. SCREENING ASSAY FOR COMPOUNDS WHICH ALTER ENDOGENOUS OP1 RECEPTOR 
15 EXPRESSION LEVELS 

Candidate compound (s) which can be administered to affect the level of 
a given endogenous OP1 receptor can be found using the following screening 
assay, in which the level of OP1 receptor production by a cell type which 
produces measurable levels of the receptor is determined by incubating the 
cell in culture with and without the candidate compound, in order to assess 
the effects of the compound on the cell. This also can be accomplished by 
detection of the OPl receptor either at the protein level by Western blot or 
immunolocalization, or at the RNA level by Northern blot or in situ 
hydridization. The protocol is based on a procedure for identifying compounds 
which alter endogenous levels of OPl expression, a detailed description also 
may be found in PCT US 92/07359, incorporated herein by reference. 

Cell cultures of, for example, bone, brain, intestine, lung, heart, 
eye, breast, gonads, kidney, adrenals, urinary bladder, brain, or other 
organs, may be prepared as described widely in the literature. For example, 
kidneys may be explanted from neonatal or new born or. young or adult rodents 
(mouse or rat) and used in organ culture as whole or sliced (1-4 mm) tissues. 



30 
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Primary tissue cultures and established cell lines, also derived from kidney, 
adrenals, urinary, bladder, brain, mammary, or other tissues may be 
established in multiwell plates (6 well or 24 well) according to conventional 
cell culture techniques, and are cultured in the absence or presence of serum 
5 for a period of time (1-7 days) . Cells can be cultured, for example, in 
Dulbecco’s Modified Eagle medium (Gibco, Long Island, NY) containing serum 
(e.g., fetal calf serum at 1%-10%, Gibco) or in serum- deprived medium, as 
desired, or in defined medium (e.g., containing insulin, transferrin, glucose, 
albumin, or other growth factors) . 

10 Cell samples for testing the level of 0P1 receptor production are 

collected periodically and evaluated for receptor production by immunoblot 
analysis (Sambrook et al . , eds . , 1989, Molecular Cloning, Cold Spring Harbor 
Press, Cold Spring Harbor, NY), or, alternatively, a portion of the cell 
culture itself can be collected periodically and used to prepare polyA+ RNA 
15 for mRNA analysis by Northern blot analysis. To monitor de novo receptor 

synthesis, some cultures are labeled according to conventional procedures with 

an 35 S-methionine/ 35 S-cysteine mixture for 6-24 hours and then evaluated to 
quantitate receptor synthesis by conventional immunoassay methods. 
Alternatively, anti -receptor antibodies may be labelled and incubated with the 
20 cells or cell lysates, and the bound complexes detected and quantitated by 

conventional means, such as those described hereinabove. Northern blots may 
be performed using a portion of the OPl receptor coding sequence to create 
hybridization probes, and following the RNA hybridization protocol described 
herein . 

25 Example 11 . GENERAL FORMULATION/ ADMINISTRATION CONSIDERATIONS 

The analogs and constructs described herein can be provided to an 
individual as part of a therapy to enhance, inhibit, or othewise modulate the 
in vivo binding interaction between OPl and one or more OPl -specific cell 
surface receptors. The molecules then comprise part of a pharmaceutical 
30 composition as described herein below and can be administered by any suitable 
means, preferably directly or systemically, e.g., parenterally or orally. 

Where the therapeutic molecule is to be provided directly (e.g., locally, as 
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by injection, to a desired tissue site) , or parenterally, such as by 
intravenous, subcutaneous, intramuscular, intraorbital, ophthalmic, 
intraventricular, intracranial, intracapsular , intraspinal, intracis ternal , 
intraperitoneal , buccal, rectal, vaginal, intranasal or by aerosol 
5 administration, the therapeutic preferably comprises part of an aqueous 

solution. The solution preferably is physiologically acceptable so that in 
addition to delivery of the desired morphogen to the patient, the solution 
does not otherwise adversely affect the patient's electrolyte and volume 
balance. The aqueous medium for the therapeutic molecule thus may comprise 
10 normal physiologic saline (0.9% NaCl, 0.15M), pH 7-7.4 or other 
pharmaceutical ly acceptable salts thereof. 

Useful solutions for oral or parenteral administration can be prepared 
by any of the methods well known in the pharmaceutical art, described, for 
example, in Remington's Pharmaceutical Sciences , (Gennaro, A., ed.), Mack 
15 Pub., 1990. Formulations may include, for example, polyalkylene glycols such 
as polyethylene glycol, oils of vegetable origin, hydrogenated naphthalenes, 
and the like. Formulations for direct administration, in particular, can 
include glycerol and other compositions of high viscosity. Biocompatible, 1 
preferably bioresorbable polymers, including, for example, hyaluronic acid, 

20 collagen, tricalcium phosphate, polybutyrate, polylactide, polyglycolide and 
lactide/glycolide copolymers, may be useful excipients to control the release 
of the morphogen in vivo . 

Other potentially useful parenteral delivery systems for these 
therapeutic molecules include ethylene -vinyl acetate copolymer particles, 

25 osmotic pumps, implantable infusion systems, and liposomes. Formulations for 
inhalation administration may contain as excipients, for example, lactose, or 
can be aqueous solutions containing, for example, polyoxyethylene- 9-lauryl 
ether, glycocholate and deoxycholate, or oily solutions for administration in 
the form of nasal drops, or as a gel to be applied intranasally . 

30 Alternatively, the morphogens described herein may be administered 



orally . 
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The therapeutic molecules also can be associated with means for 
targeting the therapeutic to a desired tissue. For example, tetracycline and 
diphosphonates (bisphosphonates) are known to bind to bone mineral, 
particularly at zones of bone remodeling, when they are provided systemically 
5 in a mammal. Accordingly, these molecules may be included as useful agents 
for targeting therapeutics to bone tissue. Alternatively, an antibody or 
other binding protein that interacts specifically with a surface molecule on 
the desired target tissue cells also can be used. Such targeting molecules 
further can be covalently associated to the therapeutic molecule e.g., by 
10 chemical crosslinking, or by using standard genetic engineering means to 

create, for example, an acid labile bond such as an Asp- Pro linkage. Useful 
targeting molecules can be designed, for example, using the single chain 
binding site technology disclosed, for example, in U.S. Pat. No. 5,091,513. 

Finally, therapeutic molecules can be administered alone or in 
15 combination with other molecules known to have a beneficial effect on tissue 
morphogenesis, including molecules capable of tissue repair and regeneration 
and/or inhibiting inflammation. Examples of useful cofactors for stimulating 
bone tissue growth in osteoporotic individuals, for example, include but are 
not limited to, vitamin D 3 , calcitonin, prostaglandins, parathyroid hormone, 

20 dexamethasone, estrogen and IGF-I or IGF-II . Useful cofactors for nerve 
tissue repair and regeneration can include nerve growth factors . Other 
useful cofactors include symptom- alleviating cofactors, including antiseptics, 
antibiotics, antiviral and antifungal agents and analgesics and anesthetics. 

Therapeutic molecules further can be formulated into pharmaceutical 
25 compositions by admixture with pharmaceutically acceptable nontoxic excipients 
and carriers. As noted above, such compositions can be prepared for 
parenteral administration, particularly in the form of liquid solutions or 
suspensions; for oral administration, particularly in the form of tablets or 
capsules; or intranasally, particularly in the form of powders, nasal drops or 
30 aerosols. Where adhesion to a tissue surface is desired the composition may 
include the morphogen dispersed in a fibrinogen-thrombin composition or other 
bioadhesive such as is disclosed, for example in PCT US91/09275, the 
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disclosure of which is incorporated herein by reference. The composition then 
can be painted, sprayed or otherwise applied to the desired tissue surface. 

The compositions can be formulated for parenteral or oral 
administration to humans or other mammals in therapeutically effective 
5 amounts, e.g., amounts which provide appropriate concentrations of the analog 
to target tissue for a time sufficient to induce the desired effect. 

Where the analog is to be used as part of a transplant procedure, it 
can be provided to the living tissue or organ to be transplanted prior to 
removal of tissue or organ from the donor. The analog may be provided to the 
10 donor host directly, as by injection of a formulation comprising the analog 
into the tissue, or indirectly, e.g., by oral or parenteral administration, 
using any of the means described above. 

Alternatively or, in addition, once removed from the donor, the organ 
or living tissue can be placed in a preservation solution containing the 
15 therapeutic molecule. In addition, the recipient also preferably is provided 
with the analog just prior to, or concommi tant with, transplantation. In all 
cases, the analog can be administered directly to the tissue at risk, as by 
injection to the tissue, or it may be provided systemically , either by oral or 
parenteral administration, using any of the methods and formulations described 
20 herein and/or known in the art. 

Where the therapeutic molecule comprises part of a tissue or organ 
preservation solution, any commercially available preservation solution can 
be used to advantage. For example, useful solutions known in the art include 
Collins solution, Wisconsin solution, Belzer solution, Eurocollins solution 
25 and lactated Ringer's solution. Generally, an organ preservation solution 
usually possesses one or more of the following properties: (a) an osmotic 

pressure substantially equal to that of the inside of a mammalian 
cell, (solutions typically are hyperosmolar and have K+ and/or Mg++ ions 
present in an amount sufficient to produce an osmotic pressure slightly higher 
30 than the inside of a mammalian cell) ; (b) the solution typically is capable of 

maintaining substantially normal ATP levels in the cells; and (c) the solution 
usually allows optimum maintenance of glucose metabolism in the cells. Organ 
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preservation solutions also may contain anticoagulants, energy sources such as 
glucose, fructose and other sugars, metabolites, heavy metal chelators, 
glycerol and other materials of high viscosity to enhance survival at low 
temperatures, free oxygen radical inhibiting and/or scavenging agents and a pH 
5 indicator. A detailed description of preservation solutions and useful 
components can be found, for example, in US Patent No. 5,002,965, the 
disclosure of which is incorporated herein by reference. 

As will be appreciated by those skilled in the art, the concentration 
of the compounds described in a therapeutic composition will vary depending 
10 upon a number of factors, including the dosage of the drug to be administered, 
the chemical characteristics (e.g., hydrophobic ity) of the compounds employed, 
and the route of administration. The preferred dosage of drug to be 
administered also is likely to depend on such variables as the type and extent 
of tissue loss or defect, the overall health status of the particular patient, 
15 the relative biological efficacy of the compound selected, the formulation of 
the compound, the presence and types of excipients in the formulation, and the 
route of administration. In general terms, the therapeutic molecules of this 
invention may be provided to and individual where typical dose ranges are from 
about 10 ng/kg to about X g/kg of body weight per day; a preferred dose range 
20 being from about 0.1 mg /kg to 100 mg/kg of body weight. No obvious morphogen- 
induced pathological lesions are induced when mature morphogen (e.g., OP-1, 20 
mg) is administered daily to normal growing rats for 21 consecutive days. 
Moreover, 10 mg systemic injections of morphogen (e.g. , OP-1) injected daily 
for 10 days into normal newborn mice does not produce any gross abnormalities. 
25 Other Embodiments 

The invention may be embodied in other specific forms without departing 
from the spirit or essential characteristics thereof. The present embodiments 
are therefore to be considered in all respects as illustrative and not 
restrictive, the scope of the invention being indicated by the appended claims 
30 rather than by the foregoing description, and all changes which come within 
the meaning and range of equivalency of the claims are therefore intended to 



be embraced therein. 
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SEQUENCE LISTING 



(I) GENERAL INFORMATION: 

(i) APPLICANT: 

(B) NAME: CREATIVE BIOMOLCULES, INC 

(C) STREET: 45 SOUTH STREET 

(D) STATE: HOPKINTON 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 01748 

(G) TELEPHONE: (508) -435-9001 

(H) TELEFAX: ( 508 ) -435 -0992 

(I) TELEX: 

(i) APPLICANT: 

(B) NAME: LUDWIG INSTITUTE FOR CANCER RESEARCH 

(C) STREET: 1345 AVENUE OF THE AMERICAS 

(D) STATE: NY 

(E) . COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 10105 

(G) TELEPHONE: (212 ) -765-3000 

(H) TELEFAX: 

(I) TELEX: 

(ii) TITLE OF INVENTION: MORPHOGENIC PROTEIN-SPECIFIC CELL 
SURFACE RECEPTORS AND USES THEREFOR 

(iii) NUMBER OF SEQUENCES: 15 
30 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: PATENT ADMINISTRATOR, TESTA, HURWITZ & 

THIBEAULT 

(B) STREET: 53 STATE STREET 

35 (C) CITY: BOSTON 

(D) STATE: MA 
■ (E) COUNTRY: USA 

(F) ZIP: 02109 

40 (v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 
45 

(vi) CURRENT . APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

50 

(viii ) ATTORNEY/AGENT INFORMATION: 

(A) NAME: PITCHER, EDMUND R. 

(B) REGISTRATION NUMBER: 27,829 

(C) REFERENCE/DOCKET NUMBER: CRP-097PC 
55 

(ix) TELECOMMUNICATION INFORMATION: 

: (A) TELEPHONE: (617) 248-7000 

(B) TELEFAX: (617) 248-7100 



5 



10 



15 



20 



25 



60 
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(2) INFORMATION FOR SEQ ID NOsl: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1509 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

10 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1509 

15 (D) OTHER INFORMATION: /product= ‘Human ALK1 " 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

20 ATG ACC TTG GGC TCC CCC AGG AAA GGC CTT CTG ATG CTG CTG ATG GCC 48 

Met Thr Leu Gly Ser Pro Arg Lys Gly Leu Leu Met Leu Leu Met Ala 

1 5 10 . 15 

TTG GTG ACC CAG GGA GAC CCT GTG AAG CCG TCT CGG GGC CCG CTG GTG 96 

25 Leu Val Thr Gin Gly Asp Pro Val Lys Pro Ser Arg Gly Pro Leu Val 

20 25 30 

ACC TGC ACG TGT GAG AGC CCA CAT TGC AAG GGG CCT ACC TGC CGG GGG 144 

Thr Cys Thr Cys Glu Ser Pro His Cys Lys Gly Pro Thr Cys Arg Gly 

30 .35 40 45 

GCC TGG TGC ACA GTA GTG CTG GTG CGG GAG GAG GGG AGG CAC CCC CAG 192 

Ala Trp Cys Thr Val Val Leu Val Arg Glu Glu Gly Arg His Pro Gin 

50 55 60 

35 

GAA CAT CGG GGC TGC GGG AAC TTG CAC AGG GAG CTC TGC AGG GGG CGC 240 

Glu His Arg Gly Cys Gly Asn Leu His Arg Glu Leu Cys Arg Gly Arg 

65 70 75 80 

40 CCC ACC GAG TTC GTC AAC CAC TAC TGC TGC GAC AGC CAC CTC TGC AAC 288 

Pro Thr Glu Phe Val Asn His Tyr Cys Cys Asp Ser His Leu Cys Asn 

85 90 95 

CAC AAC GTG TCC CTG GTG CTG GAG GCC ACC CAA CCT CCT TCG GAG CAG 336 

45 His Asn Val Ser Leu Val Leu Glu Ala Thr Gin Pro Pro Ser Glu Gin 

100 105 110 

CCG GGA ACA GAT GGC CAG CTG GCC CTG ATC CTG GGC CCC GTG CTG GCC 384 

Pro Gly Thr Asp Gly Gin Leu Ala Leu lie Leu Gly Pro Val Leu Ala 

50 115 120 125 

TTG CTG GCC CTG GTG GCC CTG GGT GTC CTG GGC CTG TGG CAT GTC CGA 432 

Leu Leu Ala Leu Val Ala Leu Gly Val Leu Gly Leu Trp His Val Arg 

130 135 140 

55 

CGG AGG CAG GAG AAG CAG CGT GGC CTG CAC AGC GAG CTG GGA GAG TCC 4 80 

Arg Arg Gin Glu Lys Gin Arg Gly Leu His Ser Glu Leu Gly Glu Ser 

145 150 155 160 

60 AGT CTC ATC CTG AAA GCA TCT GAG CAG GGC GAC ACG ATG TTG GGG GAC 528 
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Ser 


Leu 


lie 


Leu 


Lys 

165 


Ala Ser 


Glu 


Gin 


Gly 

170 


Asp 


Thr 


Met 


Leu 


Gly 

175 


Asp 




CTC 


CTG 


GAC 


AGT 


GAC 


TGC ACC 


ACA 


GGG 


AGT 


GGC 


TCA 


GGG 


CTC 


CCC 


TTC 


5 


Leu 


Leu 


Asp 


Ser 

180 


Asp 


Cys Thr 


Thr 


Gly 

185 


Ser 


Gly 


Ser 


Gly Leu 
190 


Pro 


Phe 




CTG 


GTG 


CAG 


AGG 


ACA 


GTG GCA 


CGG 


CAG 


GTT 


GCC 


TTG 


GTG 


GAG 


TGT 


GTG 


10 


Leu 


Val 


Gin 

195 


Arg 


Thr 


Val Ala 


Arg 

200 


Gin 


Val 


Ala 


Leu 


Val 

205 


Glu 


Cys 


Val 




GGA 


AAA 


GGC 


CGC 


TAT 


GGC GAA 


GTG 


TGG 


CGG 


GGC 


TTG 


TGG 


CAC 


GGT 


GAG 


15 


Gly 


Lys 

210 


Gly Arg 


Tyr 


Gly Glu 
215 


Val 


Trp 


Arg 


Gly 


Leu 

220 


Trp 


His 


Gly 


Glu 


AGT 


GTG 


GCC 


GTC 


AAG 


ATC TTC 


TCC 


TCG 


AGG 


GAT 


GAA 


CAG 


TCC 


TGG 


TTC 




Ser 

225 


Val 


Ala 


Val 


Lys 


lie Phe 
230 


Ser 


Ser 


Arg 


Asp 

235 


Glu 


Gin 


Ser 


Trp 


Phe 

240 


20 


CGG 


GAG 


ACT 


GAG 


ATC 


TAT AAC 


ACA 


GTA 


TTG 


CTC 


AGA 


CAC 


GAC 


AAC 


ATC 




Arg 


Glu 


Thr 


Glu 


lie 

245 


Tyr Asn 


Thr 


Val 


Leu 

250 


Leu 


Arg 


His 


Asp 


Asn 

255 


lie 




CTA 


GGC 


TTC 


ATC 


GCC 


TCA < GAC 


ATG 


ACC 


TCC 


CGC 


AAC 


TCG 


AGC 


ACG 


CAG 


25 


Leu 


Gly 


Phe 


lie 

260 


Ala 


Ser' Asp 


Met 


Thr 

265 


Ser 


Arg 


Asn 


Ser 


Ser 

270 


Thr 


Gin 




CTG 


TGG 


CTC 


ATC 


ACG 


CAC TAC 


CAC 


GAG 


CAC 


GGC 


TCC 


CTC 


TAC 


GAC 


TTT 


30 


Leu 


Trp 


Leu 

275 


lie 


Thr 


His Tyr 


His 

280 


Glu 


His 


Gly 


Ser 


Leu 

285 


Tyr 


Asp 


Phe 




CTG 


CAG 


AGA 


CAG 


ACG 


CTG GAG 


CCC 


CAT 


CTG 


GCT 


CTG 


AGG 


CTA 


GCT 


GTG 


35 


Leu 


Gin 

290 


Arg 


Gin 


Thr 


Leu Glu 
295 


Pro 


His 


Leu 


Ala 


Leu. 

300 


Arg 


Leu 


Ala 


Val 


TCC 


GCG 


GCA 


TGC 


GGC 


CTG GCG 


CAC 


CTG 


CAC 


GTG 


GAG 


ATC 


TTC 


GGT 


ACA 




Ser 

305 


Ala 


Ala 


Cys 


Gly 


Leu Ala 
310 


His 


Leu 


His 


Val 

315 


Glu 


lie 


Phe 


Gly 


Thr 

320 


40 


CAG 


GGC 


AAA 


CCA 


GCC 


ATT GCC 


CAC 


CGC 


GAC 


TTC 


AAG 


AGC 


CGC 


AAT 


GTG 




Gin 


Gly 


Lys 


Pro 


Ala 

325 


lie Ala 


His 


Arg 


Asp 

330 


Phe 


Lys 


Ser 


Arg 


Asn 

335 


Val 




CTG 


GTC 


AAG 


AGC 


AAC 


CTG CAG 


TGT 


TGC 


ATC 


GCC 


GAC 


CTG 


GGC 


CTG 


GCT 


45 


Leu 


Val 


Lys 


Ser 

340 


Asn 


Leu Gin 


Cys 


Cys 

345 


lie 


Ala 


Asp 


Leu Gly 
350 


Leu 


Ala 




GTG 


ATG 


CAC 


TCA 


CAG 


GGC AGC 


GAT 


TAC 


CTG 


GAC 


ATC 


GGC 


AAC 


AAC 


CCG 


50 


Val 


Met 


His 

355 


Ser 


Gin 


Gly Ser 


Asp 

360 


Tyr 


Leu 


Asp 


lie 


Gly Asn 
365 


Asn 


Pro 




AGA 


GTG 


GGC 


ACC 


AAG 


CGG TAC 


ATG 


GCA 


CCC 


GAG 


GTG 


CTG 


GAC 


GAG 


CAG 




Arg 


Val 


Gly Thr 


Lys 


Arg Tyr 


Met 


Ala 


Pro 


Glu 


Val 


Leu 


Asp 


Glu 


Gin 



370 375 380 



55 

AT C CGC ACG GAC TGC TTT GAG TCC TAC AAG TGG ACT GAC ATC TGG GCC 

lie Arg Thr Asp Cys Phe Glu Ser Tyr Lys Trp Thr Asp lie Trp Ala 

385 390 395 400 

TTT GGC CTG GTG CTG TGG GAG ATT GCC CGC CGG ACC ATC GTG AAT GGC 



576 

624 

672 

720 

768 

816 

864 

912 

960 

1008 

1056 

1104 

1152 

1200 



60 



1248 
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Phe Gly 


Leu 


Val 


Leu 


Trp 


Glu 


lie 


Ala 


Arg Arg 


Thr 


lie 


Val 


Asn 


Gly 














405 










410 










415 






ATC GTG 


GAG 


GAC 


TAT 


AGA 


CCA 


CCC 


TTC 


TAT 


GAT 


GTG 


GTG 


CCC 


AAT 


GAC 


1296 


5 


lie 


Val 


Glu 


Asp 


Tyr 


Arg 


Pro 


Pro 


Phe 


Tyr Asp Val 


Val 


Pro 


Asn 


Asp 












420 










425 










430 








CCC 


AGC 


TTT 


GAG 


GAC 


ATG 


AAG 


AAG 


GTG 


GTG 


TGT 


GTG 


GAT 


CAG 


CAG 


ACC 


1344 




Pro 


Ser 


Phe 


Glu 


Asp 


Met 


Lys 


Lys 


Val 


Val 


Cys 


Val 


Asp 


Gin 


Gin 


Thr 




10 






435 










440 










445 












CCC 


ACC 


ATC 


CCT 


AAC 


CGG 


CTG 


GCT 


GCA 


GAC 


CCG 


GTC 


CTC 


TCA 


GGC 


CTA 


1392 




Pro 


Thr 


He 


Pro 


Asn 


Arg 


Leu 


Ala 


Ala 


Asp 


Pro 


Val 


Leu 


Ser 


Gly 


Leu 




15 




450 










455 










460 










GCT 


CAG 


ATG 


ATG 


CGG 


GAG 


TGC 


TGG 


TAC 


CCA 


AAC 


CCC 


TCT 


GCC 


CGA 


CTC 


1440 




Ala 


Gin 


Met 


Met 


Arg 


Glu 


Cys 


Trp 


Tyr 


Pro 


Asn 


Pro 


Ser 


Ala 


Arg 


Leu 






465 










470 










475 








480 




20 


ACC 


GCG 


CTG 


CGG 


ATC 


AAG 


AAG 


ACA 


CTA 


CAA 


AAA 


ATT 


AGC 


AAC 


AGT 


CCA 


1488 




Thr 


Ala 


Leu 


Arg 


lie 


Lys 


Lys 


Thr 


Leu 


Gin 


Lys 


lie 


Ser 


Asn 


Ser 


Pro 














485 










490 










495 








GAG 


AAG 


CCT 


AAA 


GTG 


ATT 


CAA 




















1509 


25 


Glu 


Lys 


Pro 


Lys 


Val 


lie 


Gin 























500 



(2) INFORMATION FOR SEQ ID NO: 2: 

30 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 503 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

35 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



40 


Met 

1 


Thr 


Leu 


Gly 


Ser 

5 


Pro 


Arg 


Lys 


Gly 


Leu 

10 


Leu 


Met 


Leu 


Leu 


Met 

15 


Ala 


45 


Leu 


Val 


Thr 


Gin 

20 


Gly 


Asp 


Pro 


Val 


Lys 

25 


Pro 


Ser 


Arg 


Gly 


Pro 

30 


Leu 


Val 




Thr 


Cys 


Thr 

35 


Cys 


Glu 


Ser 


Pro 


His 

40 


Cys 


Lys 


Gly 


Pro 


Thr 

45 


Cys 


Arg 


Gly 


50 


Ala 


Trp 

50 


Cys 


Thr 


Val 


Val 


Leu 

55 


Val 


Arg 


Glu 


Glu 


Gly 

60 


Arg 


His 


Pro 


Gin 




Glu 

65 


His 


Arg 


Gly 


Cys 


Gly 

70 


Asn 


Leu 


His 


Arg 


Glu 

75 


Leu 


Cys 


Arg 


Gly 


Arg 

80 


55 


Pro 


Thr 


Glu 


Phe 


Val 

85 


Asn 


His 


Tyr 


Cys 


Cys 

90 


Asp 


Ser 


His 


Leu 


Cys 

95 


Asn 


60 


His 


Asn 


Val 


Ser 

100 


Leu 


Val 


Leu 


Glu 


Ala 

105 


Thr 


Gin 


Pro 


Pro 


Ser 

110 


Glu 


Gin 
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Pro 


Gly 


Thr 

115 


Asp 


Gly 


Gin Leu 


Ala 

120 


Leu 


lie 


Leu 


Gly 


Pro 

125 


Val 


Leu 


Ala 


5 


Leu 


Leu 

130 


Ala 


Leu 


Val 


Ala Leu 
135 


Gly 


Val 


Leu 


Gly 


Leu 

140 


Trp 


His 


Val 


Arg 




Arg 

145 


Arg 


Gin 


Glu 


Lys 


Gin Arg 
150 


Gly 


Leu 


His 


Ser 

155 


Glu 


Leu 


Gly 


Glu 


Ser 

160 


10 


Ser 


Leu 


lie 


Leu 


Lys 

165 


Ala Ser 


Glu 


Gin 


Gly 

170 


Asp 


Thr 


Met 


Leu 


Gly 

175 


Asp 




Leu 


Leu 


Asp 


Ser 

180 


Asp 


Cys Thr 


Thr 


Gly 

185 


Ser 


Gly 


Ser 


Gly 


Leu 

190 


Pro 


Phe 


15 




Leu 


Val 


Gin 

195 


Arg 


Thr 


val Ala 


Arg 

200 


Gin 


Val 


Ala 


Leu 


Val 

205 


Glu 


Cys 


Val 


20 


Gly 


Lys 

210 


Gly 


Arg 


Tyr 


Gly Glu 
215 


Val 


Trp 


Arg 


Gly 


Leu 

220 


Trp 


His 


Gly 


Glu 




Ser 

225 


Val 


Ala 


Val 


Lys 


lie Phe 
230 


Ser 


Ser 


Arg 


Asp 

235 


Glu 


Gin 


Ser 


Trp 


Phe 

240 


25 


Arg 


Glu 


Thr 


Glu 


lie 

245 


Tyr Asn 


Thr 


Val 


Leu 

250 


Leu 


Arg 


His 


Asp 


Asn 

255 


lie 




Leu 


Gly 


Phe 


lie 

260 


Ala 


Ser Asp 


Met 


Thr 

265 


Ser 


Arg 


Asn 


Ser 


Ser 

270 


Thr 


Gin 


30 




Leu 


Trp 


Leu 

275 


lie 


Thr 


His Tyr 


His 

280 


Glu 


His 


Gly 


Ser 


Leu 

285 


Tyr 


Asp 


Phe 


35 


Leu 


Gin 

290 


Arg 


Gin 


Thr 


Leu Glu 
295 


Pro 


His 


Leu 


Ala 


Leu 

300 


Arg 


Leu 


Ala 


Val 




Ser 

305 


Ala 


Ala 


Cys 


Gly 


Leu Ala 
310 


His 


Leu 


His 


Val 

315 


Glu 


lie 


Phe 


Gly 


Thr 

320 


40 


Gin 


Gly 


Lys 


Pro 


Ala 

325 


lie Ala 


His 


Arg 


Asp 

330 


Phe 


Lys 


Ser 


Arg 


Asn 

335 


Val 




Leu 


Val 


Lys 


Ser 

340 


Asn 


Leu Gin 


Cys 


Cys 

345 


lie 


Ala 


Asp 


Leu 


Gly 

350 


Leu 


Ala 


45 




Val 


Met 


His 

355 


Ser 


Gin 


Gly Ser 


Asp 

360 


Tyr 


Leu 


Asp 


He 


Gly 

365 


Asn 


Asn 


Pro 


50 


Arg 


Val 

370 


Gly 


Thr 


Lys 


Arg Tyr 
375 


Met 


Ala 


Pro 


Glu 


Val 

380 


Leu 


Asp 


Glu 


Gin 




lie 

385 


Arg 


Thr 


Asp 


Cys 


Phe Glu 
390 


Ser 


Tyr 


Lys 


Trp 

395 


Thr 


Asp 


lie 


Trp 


Ala 

400 


55 


Phe 


Gly 


Leu 


Val 


Leu 

405 


Trp Glu 


lie 


Ala 


Arg 

410 


Arg 


Thr 


lie 


Val 


Asn 

415 


Gly 




He 


Val 


Glu 


Asp 

420 


Tyr 


Arg Pro 


Pro 


Phe 

425 


Tyr 


Asp 


Val 


Val 


Pro 

430 


Asn 


Asp 



60 
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15 



Pro 


Ser 


Phe 

435 


Glu 


Asp 


Met 


Lys 


Lys 

440 


Val 


Val 


Cys 


Val 


Asp 

445 


Gin 


Gin 


Thr 


Pro 


Thr 

450 


lie 


Pro 


Asn 


Arg 


Leu 

455 


Ala 


Ala 


Asp 


Pro 


Val 

460 


Leu 


Ser Gly 


Leu 


Ala 

465 


Gin 


Met 


Met 


Arg 


Glu 

470 


Cys 


Trp 


Tyr 


Pro 


Asn 

475 


Pro 


Ser 


Ala 


Arg 


Leu 

480 


Thr 


Ala 


Leu 


Arg 


He 

485 


Lys 


Lys 


Thr 


Leu 


Gin Lys 
490 


lie 


Ser 


Asn 


Ser 

495 


Pro 


Glu 


Lys 


Pro 


Lys 

500 


Val 


lie 


Gin 




















(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 3: 



















20 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2724 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

30 (B) LOCATION: 104.. 1633 

(D) OTHER INFORMATION: /product= "Human ALK2 " 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

CTCCGAGTAC CCCAGTGACC AGAGTGAGAG AAGCTCTGAA CGAGGGCACG CGGCTTGAAG 



40 



45 



50 



55 



GACTGTGGGC AGATGTGACC AAGAGCCTGC ATTAAGTTGT ACA ATG GTA GAT GGA 

Met Val Asp Gly 

1 



GTG 


ATG 


ATT 


CTT 


CCT 


GTG 


CTT 


ATC 


ATG 


ATT 


GCT 


CTC 


CCC 


TCC 


CCT 


AGT 


Val 

5 


Met 


lie 


Leu 


Pro 


Val 

10 


Leu 


lie 


Met 


lie 


Ala 

15 


Leu 


Pro 


Ser 


Pro 


Ser 

20 


ATG 


GAA 


GAT 


GAG 


AAG 


CCC 


AAG 


GTC 


AAC 


CCC 


AAA 


CTC 


TAC 


ATG 


TGT 


GTG 


Met 


Glu 


Asp 


Glu 


Lys 

25 


Pro 


Lys 


Val 


Asn 


Pro 

30 


Lys 


Leu 


Tyr 


Met 


Cys 

35 


Val 


TGT 


GAA 


GGT 


CTC 


TCC 


TGC 


GGT 


AAT 


GAG 


GAC 


CAC 


TGT 


GAA 


GGC 


CAG 


CAG 


Cys 


Glu 


Gly 


Leu 

40 


Ser 


Cys 


Gly 


Asn 


Glu 

45 


Asp 


His 


Cys 


Glu 


Gly 

50 


Gin 


Gin 


TGC 


TTT 


TCC 


TCA 


CTG 


AGC 


ATC 


AAC 


GAT 


GGC 


TTC 


CAC 


GTC 


TAC 


CAG 


AAA 


Cys 


Phe 


Ser 

55 


Ser 


Leu 


Ser 


lie 


Asn 

60 


Asp 


Gly 


Phe 


His 


Val 

65 


Tyr 


Gin 


Lys 


GGC 


TGC 


TTC 


CAG 


GTT 


TAT 


GAG 


CAG 


GGA 


AAG 


ATG 


ACC 


TGT 


AAG 


ACC 


CCG 


Gly 


Cys 

70 


Phe 


Gin 


Val 


Tyr 


Glu 

75 


Gin 


Gly 


Lys 


Met 


Thr 

80 


Cys 


Lys 


Thr 


Pro 



60 

115 

163 

211 

259 

307 

355 



60 
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CCG 


TCC 


CCT 


GGC 


CAA 


GCT 


GTG 


GAG 


TGC 


TGC 


CAA 


GGG 


GAC 


TGG 


TGT 


AAC 


403 




Pro 


Ser 


Pro 


Gly Gin Ala 


Val 


Glu 


Cys 


Cys 


Gin 


Gly Asp 


Trp 


Cys 


Asn 




c 


85 










90 










95 










100 




D 


AGG 


AAC 


ATC 


ACG 


GCC 


CAG 


CTG 


CCC 


ACT 


AAA 


GGA 


AAA 


TCC 


TTC 


CCT 


GGA 


451 




Arg 


Asn 


He 


Thr 


Ala 


Gin 


Leu 


Pro 


Thr 


Lys 


Gly 


Lys 


Ser 


Phe 


Pro Gly 














105 










110 










115 






10 


ACA 


CAG 


AAT 


TTC 


CAC 


TTG 


GAG 


GTT 


GGC 


CTC 


ATT 


ATT 


CTC 


TCT 


GTA 


GTG 


499 




Thr 


Gin 


Asn 


Phe 


His 


Leu 


Glu 


Val 


Gly 


Leu 


lie 


lie 


Leu 


Ser 


Val 


Val 












120 










125 










130 










TTC 


GCA 


GTA 


TGT 


CTT 


TTA 


GCC 


TGC 


CTG 


CTG 


GGA 


GTT 


GCT 


CTC 


CGA 


AAA 


547 


15 


Phe 


Ala 


Val 


Cys 


Leu 


Leu 


Ala 


Cys 


Leu 


Leu Gly 


Val 


Ala 


Leu 


Arg 


Lys 










135 










140 










145 








TTT 


AAA 


AGG 


CGC 


AAC 


CAA 


GAA 


CGC 


CTC 


AAT 


CCC 


CGA 


GAC 


GTG 


GAG 


TAT 


595 




Phe 


Lys 


Arg 


Arg 


Asn 


Gin 


Glu 


Arg 


Leu 


Asn 


Pro 


Arg 


Asp Val 


Glu 


Tyr 




20 




150 










155 










160 












GGC 


ACT 


ATC 


GAA 


GGG 


CTC 


ATC 


ACC 


ACC 


AAT 


GTT 


GGA 


GAC 


AGC 


ACT 


TTA 


643 




Gly 


Thr 


lie 


Glu 


Gly 


Leu 


lie 


Thr 


Thr 


Asn 


Val 


Gly Asp 


Ser 


Thr 


Leu 




25 


165 










170 










175 










180 




GCA 


GAT 


TTA 


TTG 


GAT 


CAT 


TCG 


TGT 


ACA 


TCA 


GGA 


AGT 


GGC 


TCT 


GGT 


CTT 


691 




Ala 


Asp 


Leu 


Leu 


Asp 


His 


Ser 


Cys 


Thr 


Ser Gly 


Ser Gly 


Ser Gly 


Leu 














185 










190 










195 






30 


CCT 


TTT 


CTG 


GTA 


CAA 


AGA 


ACA 


GTG 


GCT 


CGC 


CAG 


ATT 


ACA 


CTG 


TTG 


GAG 


739 




Pro 


Phe 


Leu 


Val 


Gin 


Arg 


Thr 


Val 


Ala 


Arg 


Gin 


lie 


Thr 


Leu 


Leu 


Glu 












200 










205 










210 










TGT 


GTC 


GGG 


AAA 


GGC 


AGG 


TAT 


GGT 


GAG 


GTG 


TGG 


AGG 


GGC 


AGC 


TGG 


CAA 


787 


35 


Cys 


Val 


Gly 


Lys 


Gly Arg 


Tyr 


Gly 


Glu 


Val 


Trp 


Arg 


Gly 


Ser 


Trp 


Gin 










215 










220 










225 












GGG 


GAA 


AAT 


GTT 


GCC 


GTG 


AAG 


ATC 


TTC 


TCC 


TCC 


CGT 


GAT 


GAG 


AAG 


TCA 


835 




Gly 


Glu 


Asn 


Val 


Ala 


Val 


Lys 


lie 


Phe 


Ser 


Ser 


Arg 


Asp 


Glu 


Lys 


Ser 




40 




230 










235 










240 














TGG 


TTC 


AGG 


GAA 


ACG 


GAA 


TTG 


TAC 


AAC 


ACT 


GTG 


ATG 


CTG 


AGG 


CAT 


GAA 


883 




Trp 


Phe 


Arg 


Glu 


Thr 


Glu 


Leu 


Tyr 


Asn 


Thr 


Val 


Met 


Leu Arg 


His 


Glu 




45 


245 










250 










255 










260 




AAT 


ATC 


TTA 


GGT 


TTC 


ATT 


GCT 


TCA 


GAC 


ATG 


ACA 


TCA 


AGA 


CAC 


TCC 


AGT 


931 




Asn 


lie 


Leu 


Gly 


Phe 


lie 


Ala 


Ser 


Asp 


Met 


Thr 


Ser 


Arg 


His 


Ser 


Ser 














265 










270 










275 






50 


ACC 


CAG 


CTG 


TGG 


TTA 


ATT 


ACA 


CAT 


TAT 


CAT 


GAA 


ATG 


GGA 


TCG 


TTG 


TAC 


979 




Thr 


Gin 


Leu 


Trp 


Leu 


lie 


Thr 


His 


Tyr 


His 


Glu 


Met Gly 


Ser 


Leu 


Tyr 












280 










285 










290 








GAC 


TAT 


CTT 


CAG 


CTT 


ACT 


ACT 


CTG 


GAT 


ACA 


GTT 


AGC 


TGC 


CTT 


CGA 


ATA 


1027 


55 


Asp 


Tyr 


Leu 


Gin 


Leu 


Thr 


Thr 


Leu 


Asp 


Thr 


Val 


Ser 


Cys 


Leu 


Arg 


lie 










295 










300 










305 












GTG 


CTG 


TCC 


ATA 


GCT 


AGT 


GGT 


CTT 


GCA 


CAT 


TTG 


CAC 


ATA 


GAG 


ATA 


TTT 


1075 




Val 


Leu 


Ser 


lie 


Ala 


Ser 


Gly 


Leu 


Ala 


His 


Leu 


His 


lie 


Glu 


lie 


Phe 




60 




310 










315 










320 
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GGG 


ACC 


CAA 


GGG 


AAA 


CCA 


GCC 


ATT 


GCC 


CAT 


CGA 


GAT 


TTA 


AAG 


AGC 


AAA 


c 


Gly 

325 


Thr 


Gin 


Gly 


Lys 


Pro 

330 


Ala 


lie 


Ala 


His 


Axg 

335 


Asp 


Leu 


Lys 


Ser 


Lys 

340 


D 


AAT 


ATT 


CTG 


GTT 


AAG 


AAG 


AAT 


GGA 


CAG 


TGT 


TGC 


ATA 


GCA 


GAT 


TTG 


GGC 




Asn 


He 


Leu 


Val 


Lys 

345 


Lys 


Asn 


Gly 


Gin 


Cys 

350 


Cys 


lie 


Ala 


Asp 


Leu 

355 


Gly 


10 


CTG 


GCA 


GTC 


ATG 


CAT. 


TCC 


CAG 


AGC 


ACC 


AAT 


CAG 


CTT 


GAT 


GTG 


GGG 


AAC 




Leu 


Ala 


Val 


Met 

360 


His 


Ser 


Gin 


Ser 


Thr 

365 


Asn 


Gin 


Leu 


Asp 


Val 

370 


Gly 


Asn 




AAT 


CCC 


CGT 


GTG 


GGC 


ACC 


AAG 


CGC 


TAC 


ATG 


GCC 


CCC 


GAA 


GTT 


CTA 


GAT 


15 


Asn 


Pro 


Arg 

375 


Val 


Gly 


Thr 


Lys 


Arg 

380 


Tyr 


Met 


Ala 


Pro 


Glu 

385 


Val 


Leu 


Asp 




GAA 


ACC 


ATC 


CAG 


GTG 


GAT 


TGT 


TTC 


GAT 


TCT 


TAT 


AAA 


AGG 


GTC 


GAT 


ATT 


20 


Glu 


Thr 

390 


lie 


Gin 


Val 


Asp 


Cys 

395 


Phe 


Asp 


Ser 


Tyr 


Lys 

400 


Arg 


Val 


Asp 


lie 




TGG 


GCC 


TTT 


GGA 


CTT 


GTT 


TTG 


TGG 


GAA 


GTG 


GCC 


AGG 


CGG 


ATG 


GTG 


AGC 


25 


Trp 

405 


Ala 


Phe 


Gly 


Leu 


Val 

410 


Leu 


Trp 


Glu 


Val 


Ala 

415 


Arg 


Arg 


Met 


Val 


Ser 

420 


AAT 


GGT 


ATA 


GTG 


GAG 


GAT 


TAC 


AAG 


CCA 


CCG 


TTC 


TAC 


GAT 


GTG 


GTT 


CCC 




Asn 


Gly 


lie 


Val 


Glu 

425 


Asp 


Tyr 


Lys 


Pro 


Pro 

430 


Phe 


Tyr 


Asp 


Val 


Val 

.435 


Pro 


30 


AAT 


GAC 


CCA 


AGT 


TTT 


GAA 


GAT 


ATG 


AGG 


AAG 


GTA 


GTC 


TGT 


GTG 


GAT 


CAA 




Asn 


Asp 


Pro. 


Ser 

440 


Phe 


Glu 


Asp 


Met 


Arg 

445 


Lys 


Val 


Val 


Cys 


Val 

450 


Asp 


Gin 




CAA 


AGG 


CCA 


AAC 


ATA 


CCC 


AAC 


AGA 


TGG 


TTC 


TCA 


GAC 


CCG 


ACA 


TTA 


ACC 


35 


Gin 


Arg 


Pro 

455 


Asn 


lie 


Pro 


Asn 


Arg 

460 


Trp 


Phe 


Ser 


Asp 


Pro 

465 


Thr 


Leu 


Thr 




TCT 


CTG 


GCC 


AAG 


CTA 


ATG 


AAA 


GAA 


TGC 


TGG 


TAT 


CAA 


AAT 


CCA 


TCC 


GCA 


40 


Ser 


Leu 

470 


Ala 


Lys 


Leu 


Met 


Lys 

475 


Glu 


Cys 


Trp 


Tyr 


Gin 

480 


Asn 


Pro 


Ser 


Ala 




AGA 


CTC 


ACA 


GCA 


CTG 


CGT 


ATC 


AAA 


AAG 


ACT 


TTG 


ACC 


AAA 


ATT 


GAT 


AAT 


45 


Arg 

485 

TCC 

Ser 


Leu 

CTC 

Leu 


Thr 

GAC 

Asp 


Ala 

AAA 

Lys 


Leu 

TTG 

Leu 

505 


Arg 

490 

AAA 

Lys 


lie 

ACT 

Thr 


Lys 

GAC 

Asp 


Lys 

TGT 

Cys 


Thr Leu Thr Lys lie Asp 
495 

TGACATTTTC ATAGTGTCAA 
510 


Asn 

500 



50 GAAGGAAGAT TTGACGTTGT TGTCATTGTC CAGCTGGGAC CTAATGCTGG CCTGACTGGT 

TGTCAGAATG GAATCCATCT GTCTCCCTCC CCAAATGGCT GCTTTGACAA GGCAGACGTC 

GTACCCAGCC ATGTGTTGGG GAGACATCAA AACCACCCTA ACCTCGCTCG ATGACTGTGA 
55 

ACTGGGCATT TCACGAACTG TTCACACTGC AGAGACTAAT GTTGGACAGA CACTGTTGCA 
AAGGTAGGGA CTGGAGGAAC ACAGAGAAAT CCTAAAAGAG ATCTGGGCAT TAAGTCAGTG 
GCTTTGCATA : GCTTTCACAA GTCTCCTAGA CACTCCCCAC GGGAAACTCA AGGAGGTGGT 



1123 

1171 

1219 

1267 

1315 

13 63 

1411 

1459 

1507 

1555 

1603 

1650 

1710 

1770 

1830 

1890 

1950 



60 



2010 
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GAATTTTTAA TCAGCAATAT TGCCTGTGCT TCTCTTCTTT ATTGCACTAG GAATTCTTTG 

CATTCCTTAC TTGCACTGTT ACTCTTAATT TTAAAGACCC AACTTGCCAA AATGTTGGCT 
5 

GCGTACTCCA CTGGTCTGTC TTTGGATAAT AGGAATTCAA TTTGGCAAAA CAAAATGTAA 

TGTCAGACTT TGCTGCATTT TACACATGTG CTGATGTTTA CAATGATGCC GAACATTAGG 

10 AATTGTTTAT ACACAACTTT GCAAATTATT TATTACTTGT GCACTTAGTA GTTTTTACAA 

AACTGCTTTG TGCATATGTT AAAGCTTATT TTTATGTGGT CTTATGATTT TATTACAGAA 

ATGTTTTTAA CACTATACTC TAAAATGGAC ATTTTCTTTT ATTATCAGTT AAAATCACAT 
15 

TTTAAGTGCT TCACATTTGT ATGTGTGTAG ACTGTAACTT TTTTTCAGTT CATATGCAGA 

ACGTATTTAG CCATTACCCA CGTGACACCA CCGAATATAT TATCGATTTA GAAGCAAAGA 

20 TTTCAGTAGA ATTTTAGTCC TGAACGCTAC GGGGAAAATG CATTTTCTTC AGAATTATCC 

ATTACGTGCA TTTAAACTCT GCCAGAAAAA AATAACTATT TTGTTTTAAT CTACTTTTTG 

TATTTAGTAG TTATTTGTAT AAATTAAATA AACTGTTTTC AAGTCAAAAA AAAA 
25 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

30 (A) LENGTH: 509 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
35 







(xi) ; 


SEQUENCE 


DESCRIPTION 


: SEQ ID 


NO: 


4: 










40 


Met 

1 


Val 


Asp 


Gly Val 
5 


Met 


lie Leu 


Pro 


Val 

10 


Leu 


lie 


Met 


lie 


Ala 

15 


Leu 




Pro 


Ser 


Pro 


Ser Met 
20 


Glu 


Asp Glu 


Lys 

25 


Pro 


Lys 


Val 


Asn 


Pro 

30 


Lys 


Leu 


45 


Tyr 


Met 


Cys 

35 


Val Cys 


Glu 


Gly Leu 
40 


Ser 


Cys 


Gly Asn 


Glu 

45 


Asp His 


Cys 




Glu 


Gly 

50 


Gin 


Gin Cys 


Phe 


Ser Ser 
55 


Leu 


Ser 


He 


Asn 

60 


Asp 


Gly 


Phe 


His 


50 


Val 

65 


Tyr 


Gin Lys Gly 


Cys 

70 


Phe Gin 


Val 


Tyr 


Glu 

75 


Gin 


Gly 


Lys 


Met 


Thr 

80 


55 


Cys 


Lys 


Thr 


Pro Pro 
85 


Ser 


Pro Gly Gin 


Ala 

90 


Val 


Glu 


Cys 


Cys Gin Gly 
95 




Asp 


Trp 


Cys 


Asn Arg 
100 


Asn 


lie Thr 


Ala 

105 


Gin 


Leu 


Pro 


Thr 


Lys 

.110 


Gly Lys 


60 


Ser 


Phe 


Pro 

115 


Gly Thr 


Gin 


Asn Phe 
120 


His 


Leu 


Glu 


Val 


Gly 

125 


Leu 


lie 


lie 



2070 

2130 

2190 

2250 

2310 

2370 

2430 

2490 

2550 

2610 

2670 

2724 
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5 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



Leu 


Ser 

130 


Val 


Val 


Phe Ala 


Val 

135 


Cys 


Leu 


Leu 


Ala Cys 
140 


Leu 


Leu 


Gly 


Val 


Ala 

145 


Leu 


Arg 


Lys 


Phe Lys 
150 


Arg 


Arg 


Asn 


Gin 


Glu Arg 
155 


Leu 


Asn 


Pro 


Arg 

160 


Asp 


Val 


Glu 


Tyr 


Gly Thr 
165 


lie 


GlU 


Gly 


Leu 

170 


lie Thr 


Thr 


Asn 


Val 

175 


Gly 


Asp 


Ser 


Thr 


Leu 

180 


Ala Asp 


Leu 


Leu 


Asp 

185 


His 


Ser Cys 


Thr 


Ser 

190 


Gly 


Ser 


Gly 


Ser 


Gly 

195 


Leu 


Pro Phe 


Leu 


Val 

200 


Gin 


Arg 


Thr Val 


Ala 

205 


Arg 


Gin 


lie 


Thr 


Leu 

210 


Leu 


Glu 


Cys Val 


Gly 

215 


Lys 


Gly 


Arg 


Tyr Gly 
220 


Glu 


Val 


Trp 


Arg 


Gly 

225 


Ser 


Trp 


Gin 


Gly Glu 
230 


Asn 


Val 


Ala 


Val 


Lys lie 
235 


Phe 


Ser 


Ser 


Arg 

240 


Asp 


Glu 


Lys 


Ser 


Trp Phe 
245 


Arg 


Glu 


Thr 


Glu 

250 


Leu Tyr 


Asn 


Thr 


Val 

255 


Met 


Leu 


Arg 


His 


Glu 

260 


Asn lie 


Leu Gly 


Phe 

265 


lie 


Ala Ser 


Asp 


Met 

270 


Thr 


Ser 


Arg 


His 


Ser 

275 


Ser 


Thr Gin 


Leu 


Trp 

280 


Leu 


lie 


Thr His 


Tyr 

285 


His 


Glu 


Met 


Gly 


Ser 

290 


Leu 


Tyr 


Asp Tyr 


Leu 

295 


Gin 


Leu 


Thr 


Thr Leu 
300 


Asp 


Thr 


Val 


Ser 


Cys 

305 


Leu 


Arg 


lie 


Val Leu 
310 


Ser 


lie 


Ala 


Ser 


Gly Leu 
315 


Ala 


His 


Leu 


His 

320 


He 


Glu 


lie 


Phe 


Gly Thr 
325 


Gin Gly 


Lys 


Pro 

330 


Ala lie 


Ala 


His 


Arg 

335 


Asp 


Leu 


Lys 


Ser 


Lys 

340 


Asn He 


Leu 


Val 


Lys 

345 


Lys 


Asn Gly 


Gin 


Cys 

350 


Cys 


lie 


Ala 


Asp 


Leu 

355 


Gly Leu Ala 


Val 


Met 

360 


His 


Ser 


Gin Ser 


Thr 

365 


Asn 


Gin 


Leu 


Asp 


Val 

370 


Gly 


Asn 


Asn Pro 


Arg 

375 


Val 


Gly 


Thr 


Lys Arg 
380 


Tyr 


Met 


Ala 


Pro 


Glu 

385 


Val 


Leu 


Asp 


Glu Thr 
390 


lie 


Gin 


Val 


Asp 


Cys Phe 
395 


Asp 


Ser 


Tyr 


Lys 

400 


Arg 


Val 


Asp 


lie 


Trp Ala 
405 


Phe Gly 


Leu 


Val 

410 


Leu Trp 


Glu 


Vai 


Ala 

415 


Arg 


Arg 


Met 


Val 


Ser 

420 


Asn Gly 


lie 


Val 


Glu 

425 


Asp 


Tyr Lys 


Pro 


Pro 

430 


Phe 


Tyr 


Asp 


Val 


Val 

435 


Pro 


Asn Asp 


Pro 


Ser 

440 


Phe 


Glu 


Asp Met 


Arg 

445 


Lys 


Val 


.Val 



60 
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Cys Val Asp Gin Gin Arg Pro Asn lie Pro Asn Arg Trp Phe Ser Asp 
450 455 460 

5 Pro Thr Leu Thr Ser Leu Ala Lys Leu Met Lys Glu Cys Trp Tyr Gin 
465 470 475 480 



10 



Asn Pro Ser Ala Arg Leu Thr Ala Leu Arg lie Lys Lys Thr Leu Thr 
485 490 495 

Lys lie Asp Asn Ser Leu Asp Lys Leu Lys Thr Asp Cys 
500 505 



IS (2) INFORMATION FOR SEQ ID NO:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2932 base pairs 

(B) TYPE: nuclei c acid 

20 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
25 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 310.. 1908 

(D) OTHER INFORMATION: /product= "Human ALK3 " 
30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



GCTCCGCGCC GAGGGCTGGA GGATGCGTTC CCTGGGGTCC GGACTTATGA AAATATGCAT 60 

35 

CAGTTTAATA CTGTCTTGGA ATTCATGAGA TGGAAGCATA GGTCAAAGCT GTTTGGAGAA 120 

AATCAGAAGT ACAGTTTTAT CTAGCCACAT CTTGGAGGAG TCGTAAGAAA GCAGTGGGAG 180 

40 TTGAAGTCAT . TGTCAAGTGC TTGCGATCTT TTACAAGAAA ATCTCACTGA ATGATAGTCA 240 

TTTAAATTGG TGAAGTAGCA AGACCAATTA TTAAAGGTGA CAGTACACAG GAAACATTAC 300 

AATTGAACA ATG ACT CAG CTA TAC ATT TAC ATC AGA TTA TTG GGA GCC 348 

45 Met Thr Gin Leu Tyr lie Tyr He Arg Leu Leu Gly Ala 

1 5 10 

TAT TTG TTC ATC ATT TCT CGT GTT CAA GGA CAG AAT CTG GAT AGT ATG 396 

Tyr Leu Phe lie lie Ser Arg Val Gin Gly Gin Asn Leu Asp Ser Met 
50 15 20 25 



CTT CAT GGC ACT GGG ATG AAA TCA GAC TCC GAC CAG AAA AAG TCA GAA 444 

Leu His Gly Thr Gly Met Lys Ser Asp Ser Asp Gin Lys Lys Ser Glu 

30 35 40 45 

AAT GGA GTA ACC TTA GCA CCA GAG GAT ACC TTG CCT TTT TTA AAG TGC 492 

Asn Gly Val Thr Leu Ala Pro Glu Asp Thr Leu Pro Phe Leu Lys Cys 

50 55 60 



60 



TAT TGC TCA GGG CAC TGT CCA GAT GAT GCT ATT AAT AAC ACA TGC ATA 



540 
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Tyr 


Cys 


Ser 


Gly His 
65 


Cys 


Pro 


Asp 




ACT 


AAT 


GGA 


CAT 


TGC 


TTT 


GCC 


ATC 


5 


Thr 


Asn 


Gly 

80 


His 


Cys 


Phe 


Ala 


lie 

85 




ACC 


ACA 


TTA 


GCT 


TCA 


GGG 


TGT 


ATG 


10 


Thr 


Thr 

95 


Leu 


Ala 


Ser 


Gly 


Cys 

100 


Met 




TGC 


AAA 


GAT 


TCT 


CCA 


AAA 


GCC 


CAG 


15 


Cys 

110 


Lys 


Asp 


Ser 


Pro 


Lys 

115 


Ala 


Gin 




CGG 


ACC 


AAT 


TTA 


TGT 


AAC 


CAG 


TAT 




Arg 


Thr 


Asn 


Leu 


Cys 

130 


Asn 


Gin 


Tyr 


20 


GTC 


ATA 


GGT 


CCG 


TTT 


TTT 


GAT 


GGC 




Val 


lie 


Gly 


Pro 

145 


Phe 


Phe 


Asp 


Gly 




ATT 


TCT 


ATG 


GCT 


GTC 


TGC 


ATA 


ATT 


25 


lie 


Ser 


Met 

160 


Ala 


Val 


Cys 


lie 


lie 

165 




TTT 


TGT 


TAC 


AAA 


CAT 


TAT 


TGC 


AAG 


30 


Phe 


Cys 

175 


Tyr 


Lys 


His 


Tyr 


Cys 

180 


Lys 




AAT 


CGT 


GAT 


TTG 


GAA 


CAG 


GAT 


GAA 


35 


Asn 

190 


Arg 


Asp 


Leu 


Glu 


Gin 

195 


Asp 


Glu 




CTA 


AAA 


GAC 


CTT 


ATT 


GAC 


CAG 


TCA 




Leu 


Lys 


Asp 


Leu 


lie 

210 


Asp 


Gin 


Ser 


40 


CTA 


CCT 


TTA 


TTG 


GTT 


CAG 


CGA 


ACT 




Leu 


Pro 


Leu 


Leu 

225 


Val 


Gin 


Arg 


Thr 




CGG 


CAA 


GTT 


GGT 


AAA 


GGC 


CGA 


TAT 


45 


Arg 


Gin 


Val 

240 


Gly Lys 


Gly 


Arg 


Tyr 

245 




CGT 


GGC 


GAA 


AAA 


GTG 


GCG 


GTG 


AAA 


50 


Arg 


Gly 

255 


Glu 


Lys 


Val 


Ala 


Val 

260 


Lys 




AGC 


TGG 


TTT 


CGA 


GAA 


ACA 


GAA 


ATC 


55 


Ser 

270 


Trp 


Phe 


Arg 


Glu 


Thr 

275 


Glu 


lie 




GAA 


AAC 


ATA 


CTT 


GGT 


TTC 


ATA 


GCG 




Glu 


Asn 


He 


Leu 


Gly 

290 


Phe 


lie 


Ala 


60 


TGG 


ACT 


CAG 


CTC 


TAT 


TTG 


ATT 


ACT 
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Asp 

70 


Ala 


lie 


Asn 


Asn 


Thr 

75 


Cys 


lie 




ATA 


GAA 


GAA 


GAT 


GAC 


CAG 


GGA 


GAA 


588 


lie 


Glu 


Glu 


Asp 


Asp 

90 


Gin 


Gly 


Glu 




AAA 


TAT 


GAA 


GGA 


TCT 


GAT 


TTT 


CAG 


636 


Lys 


Tyr 


Glu 


Gly 

105 


Ser 


Asp 


Phe 


Gin 




CTA 


CGC 


CGG 


ACA 


ATA 


GAA 


TGT 


TGT 


684 


Leu 


Arg 


Arg 

120 


Thr 


lie 


Glu 


Cys 


Cys 

125 




TTG 


CAA 


CCC 


ACA 


CTG 


CCC 


CCT 


GTT 


732 


Leu 


Gin 

135 


Pro 


Thr 


Leu 


Pro 


Pro 

140 


Val 




AGC 


ATT 


CGA 


TGG 


CTG 


GTT 


TTG 


CTC 


780 


Ser 

150 


lie 


Arg 


Trp 


Leu 


Val 

155 


Leu 


Leu 




GCT 


ATG 


ATC 


ATC 


TTC 


TCC 


AGC 


TGC 


828 


Ala 


Met 


lie 


lie 


Phe 

170 


Ser 


Ser 


Cys 




AGC 


ATC 


TCA 


AGC 


AGA 


CGT 


CGT 


TAC 


876 


Ser 


lie 


Ser 


Ser 

185 


Arg 


Arg 


Arg 


Tyr 




GCA 


TTT 


ATT 


CCA 


GTT 


GGA 


GAA 


TCA 


924 


Ala 


Phe 


lie 

200 


Pro 


Val 


Gly 


Glu 


Ser 

205 




CAA 


AGT 


TCT 


GGT 


AGT 


GGG 


TCT 


GGA 


972 


Gin 


Ser 

215 


Ser 


Gly 


Ser 


Gly 


Ser 

220 


Gly 




ATT- 


GCC 


AAA 


CAG 


ATT 


CAG 


ATG 


GTC * 


1020 


lie 

230 


Ala 


Lys 


Gin 


lie 


Gin 

235 


Met 


Val 




GGA 


GAA 


GTA 


TGG 


ATG 


GGC 


AAA 


TGG 


1068 


Gly 


Glu 


Val 


Trp 


Met 

250 


Gly 


Lys 


Trp 




GTA 


TTC 


TTT 


ACC 


ACT 


GAA 


GAA 


GCC 


1116 


Val 


Phe 


Phe 


Thr 

265 


Thr 


Glu 


Glu 


Ala 




TAC 


CAA 


ACT 


GTG 


CTA 


ATG 


CGC 


CAT 


1164 


Tyr 


Gin 


Thr 

280 


Val 


Leu 


Met 


Arg 


His 

285 




GCA 


GAC 


ATT 


AAA 


GGT 


ACA 


GGT 


TCC 


1212 


Ala 


Asp 

295 


lie 


Lys 


Gly 


Thr 


Gly 

300 


Ser 




GAT 


TAC 


CAT 


GAA 


AAT 


GGA 


TCT 


CTC 


1260 
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Trp 


Thr 


Gin 


Leu 


Tyr 


Leu 


lie 


Thr 


Asp 


Tyr 


His 


Glu 


Asn 


Gly Ser 


Leu 












305 










310 










315 










TAT 


GAC 


TTC 


CTG 


AAA 


TGT 


GCT 


ACA 


CTG 


GAC 


ACC 


AGA 


GCC 


CTG 


CTT 


AAA 


1308 


5 


Tyr 


Asp 


Phe 


Leu 


Lys 


Cys 


Ala 


Thr 


Leu 


Asp 


Thr 


Arg 


Ala 


Leu 


Leu 


Lys 










320 










325 










330 










TTG 


GCT 


TAT 


TCA 


GCT 


GCC 


TGT 


GGT 


CTG 


TGC 


CAC 


CTG 


CAC 


ACA 


GAA 


ATT 


1356 




Leu 


Ala 


Tyr 


Ser 


Ala 


Ala 


Cys 


Gly 


Leu 


Cys 


His 


Leu 


His 


Thr 


Glu 


lie 




10 




335 










340 










345 














TAT 


GGC 


ACC 


CAA 


GGA 


AAG 


ccc 


GCA 


ATT 


GCT 


CAT 


CGA 


GAC 


CTA 


AAG 


AGC 


1404 




Tyr Gly 


Thr 


Gin 


Gly Lys 


Pro 


Ala 


lie 


Ala 


His 


Arg Asp 


Leu 


Lys 


Ser 




15 


350 










355 










360 








365 




AAA 


AAC 


ATC 


CTC 


ATC 


AAG 


AAA 


AAT 


GGG 


AGT 


TGC 


TGC 


ATT 


GCT 


GAC 


CTG 


1452 




Lys 


Asn 


lie 


Leu 


He 


Lys 


Lys 


Asn 


Gly Ser 


Cys 


Cys 


lie 


Ala 


Asp 


Leu 














370 










375 










380 






20 


GGC 


CTT 


GCT 


GTT 


AAA 


TTC 


AAC 


AGT 


GAC 


ACA 


AAT 


GAA 


GTT 


GAT 


GTG 


CCC 


1500 




Gly 


Leu 


Ala 


Val 


Lys 


Phe 


Asn 


Ser 


Asp 


Thr 


Asn 


Glu 


Val 


Asp Val 


Pro 












385 










390 










395 










TTG 


AAT 


ACC 


AGG 


GTG 


GGC 


ACC 


,AAA 


CGC 


TAC 


ATG 


GCT 


CCC 


GAA 


GTG 


CTG 


1548 


25 


Leu 


Asn 


Thr 


Arg 


Val 


Gly 


Thr 


Lys 


Arg 


Tyr 


Met 


Ala 


Pro 


Glu 


Val 


Leu 










400 










405 










410 












GAC 


GAA 


AGC 


CTG 


AAC 


AAA 


AAC 


CAC 


TTC 


CAG 


CCC 


TAC 


ATC 


ATG GCT 


GAC 


1596 




Asp 


Glu 


Ser 


Leu 


Asn 


Lys 


Asn 


His 


Phe 


Gin 


Pro 


Tyr 


lie 


Met 


Ala 


Asp 




30 




415 










420 










425 












ATC 


TAC 


AGC 


TTC 


GGC 


CTA 


ATC 


ATT 


TGG 


GAG 


ATG 


GCT 


CGT 


CGT 


TGT 


ATC 


1644 




He 


Tyr 


Ser 


Phe 


Gly Leu 


lie 


lie 


Trp 


Glu 


Met 


Ala 


Arg 


Arg 


Cys 


lie 




35 


430 










435 










440 








445 




ACA 


GGA 


GGG 


ATC 


GTG 


GAA 


GAA 


TAC 


CAA 


TTG 


CCA 


TAT 


TAC 


AAC 


ATG 


GTA 


1692 




Thr Gly 


Gly 


lie 


Val 


Glu 


Glu 


Tyr 


Gin 


Leu 


Pro 


Tyr 


Tyr 


Asn 


Met 


Val 














450 










455 










460 






40 


CCG 


AGT 


GAT 


CCG 


TCA 


TAC 


GAA 


GAT 


ATG 


CGT 


GAG 


GTT 


GTG 


TGT 


GTC 


AAA 


1740 




Pro 


Ser 


Asp 


Pro 


Ser 


Tyr 


Glu 


Asp 


Met 


Arg 


Glu 


Val 


Val 


Cys 


Val 


Lys 












465 










470 










475 








CGT 


TTG 


CGG 


CCA 


ATT 


GTG 


TCT 


AAT 


CGG 


TGG 


AAC 


AGT 


GAT 


GAA 


TGT 


CTA 


1788 


45 


Arg 


Leu 


Arg 


Pro 


lie 


Val 


Ser 


Asn 


Arg 


Trp Asn 


Ser Asp 


Glu 


Cys 


Leu 










480 










485 










490 












CGA 


GCA 


GTT 


TTG 


AAG 


CTA 


ATG 


TCA 


GAA 


TGC 


TGG 


GCC 


CAC 


AAT 


CCA 


GCC 


1836 




Arg Ala 


Val 


Leu 


Lys 


Leu 


Met 


Ser 


Glu 


Cys 


Trp Ala 


His 


Asn 


Pro 


Ala 




50 




495 










500 










505 














TCC 


AGA 


CTC 


ACA 


GCA 


TTG 


AGA 


ATT 


AAG 


AAG 


ACG 


CTT 


GCC 


AAG 


ATG 


GTT 


1884 




Ser 


Arg 


Leu 


Thr 


Ala 


Leu 


Arg 


lie 


Lys 


Lys 


Thr 


Leu 


Ala 


Lys 


Met 


Val 




55 


510 










515 










520 










525 




GAA 


TCC 


CAA 


GAT 


GTA 


AAA 


ATC 


TGATGGTTAA ACCATCGGAG GAGAAACTCT 


1935 




Glu 


Ser 


Gin 


Asp 


Val 


Lys 


lie 























530 



60 



AGACTGCAAG AACTGTTTTT ACCCATGGCA TGGGTGGAAT TAGAGTGGAA TAAGGATGTT 1995 
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AACTTGGTTC TCAGACTCTT TCTTCACTAC GTGTTCACAG GCTGCTAATA TTAAACCTTT 

CAGTACTCTT ATTAGGATAC AAGCTGGGAA CTTCTAAACA CTTCATTCTT TATATATGGA 
5 

CAGCTTTATT TTAAATGTGG TTTTTGATGC CTTTTTTTAA GTGGGTTTTT ATGAACTGCA 

TCAAGACTTC AATCCTGATT AGTGTCTCCA GTCAAGCTCT GGGTACTGAA TTGCCTGTTC 

10 ATAAAACGGT GCTTTCTGTG AAAGCCTTAA GAAGATAAAT GAGCGCAGCA GAGATGGAGA 

AATAGACTTT GCCTTTTACC TGAGACATTC AGTTCGTTTG TATTCTACCT TTGTAAAACA 

GCC TATAGAT GATGATGTGT TTGGGATACT GCTTATTTTA TGATAGTTTG TCCTGTGTCC 
15 

TTAGTGATGT GTGTGTGTCT CCATGCACAT GCACGCCGGG ATTCCTCTGC TGCCATTTGA 

ATTAGAAGAA AATAATTTAT ATGCATGCAC AGGAAGATAT TGGTGGCCGG TGGTTTTGTG 

20 CTTTAAAAAT GCAATATCTG ACCAAGATTC GCCAATCTCA TACAAGCCAT TTACTTTGCA 

AGTGAGATAG CTTCCCCACC AGCTTTATTT TTTAACATGA AAGCTGATGC CAAGGCCAAA 

AGAAGTTTAA AGCATCTGTA AATTTGGACT GTTTTCCTTC AACCACCATT TTTTTTGTGG 
25 

TTATTATTTT TGTCACGGAA AGCATCCTCT CCAAAGTTGG AGCTTCTATT GCCATGAACC 
ATGCTTACAA AGAAAGCACT TCTTATTGAA GTGAATTCCT GCATTTGATA GCAATGTAAG 
30 TGCCTATAAC CATGTTCTAT ATTCTTTATT CTCAGTAACT TTTAAAAGGG AAGTTATTTA 
TATTTTGTGT ATAATGTGCT TTATTTGCAA ATCACCC 



35 (2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 532 amino acids 

(B) TYPE: amino acid 

40 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



Met 

1 


Thr 


Gin 


Leu 


Tyr 

5 


lie 


Tyr 


lie 


Arg 


Leu 

10 


Leu Gly 


Ala 


Tyr 


Leu Phe 
15 


He 


He 


Ser 


Arg 

20 


Val 


Gin Gly Gin 


Asn 

25 


Leu 


Asp 


Ser 


Met 


Leu 

30 


His Gly 


Thr 


Gly Met 
35 


Lys 


Ser 


Asp 


Ser 


Asp 

40 


Gin 


Lys 


Lys 


Ser 


Glu 

45 


Asn 


Gly Val 


Thr 


Leu 

50 


Ala 


Pro 


Glu Asp 


Thr 

55 


Leu 


Pro 


Phe 


Leu 


Lys 

60 


Cys 


Tyr 


Cys Ser 


Gly 

65 


His 


Cys 


Pro 


Asp 


Asp 

70 


Ala 


He 


Asn 


Asn 


Thr 

75 


Cys 


lie 


Thr 


Asn Gly 
80 



2055 

2115 

2175 

2235 

2295 

2355 

2415 

2475 

2535 

2595 

2655 

2715 

2775 

2835 

2895 

2932 



60 




WO 95/30003 



PCT/US95/05467 



- 72 - 



His Cys Phe Ala lie lie Glu 
85 

Ala Ser Gly Cys Met Lys Tyr 
5 100 

Ser Pro Lys Ala Gin Leu Arg 
115 



10 Leu Cys Asn Gin Tyr Leu Gin 
130 135 

Pro Phe Phe Asp Gly Ser lie 
145 150 

15 

Ala Val Cys lie lie Ala Met 
165 

Lys His Tyr Cys Lys Ser He 
20 180 

Leu Glu Gin Asp Glu Ala Phe 
195 

25 Leu lie Asp Gin Ser Gin Ser 
210 215 

Leu Val Gin Arg Thr lie Ala 
225 230 

30 

Gly Lys Gly Arg Tyr Gly Glu 
245 

Lys Val Ala Val Lys Val Phe 
35 260 

Arg Glu Thr Glu lie Tyr Gin 
275 

40 Leu Gly Phe lie Ala Ala Asp 
290 295 



Leu Tyr Leu lie Thr Asp Tyr 
305 310 

45 

Leu Lys Cys Ala Thr Leu Asp 
325 

Ser Ala Ala Cys Gly Leu Cys 
50 340 

Gin Gly Lys Pro Ala lie Ala 
355 



55 Leu lie Lys Lys Asn Gly Ser 
370 375 

Val Lys Phe Asn Ser Asp Thr 
385 390 



Glu Asp Asp Gin Gly Glu Thr Thr Leu 
90 95 

Glu Gly Ser Asp Phe Gin Cys Lys Asp 
105 110 

Arg Thr lie Glu Cys Cys Arg Thr Asn 
120 125 

Pro Thr Leu Pro Pro Val Val lie Gly 
140 

Arg Trp Leu Val Leu Leu lie Ser Met 
155 160 

lie lie Phe Ser Ser Cys Phe Cys Tyr 
170 175 

Ser Ser Arg Arg Arg Tyr Asn Arg Asp 
185 190 

lie Pro Val Gly Glu Ser Leu Lys Asp 
200 205 

Ser Gly Ser Gly Ser Gly Leu Pro Leu 
220 

Lys Gin lie Gin Met Val Arg Gin Val 
235 240 

Val Trp Met Gly Lys Trp Arg Gly Glu 
250 255 

Phe Thr Thr Glu Glu Ala Ser Trp Phe 
265 270 

Thr Val Leu Met Arg His Glu Asn lie 
280 285 

lie Lys Gly Thr Gly Ser Trp Thr Gin 
300 

His Glu Asn Gly Ser Leu Tyr Asp Phe 
315 320 

Thr Arg Ala Leu Leu Lys Leu Ala Tyr 
330 335 

His Leu His Thr Glu lie Tyr Gly Thr 
345 350 

His Arg Asp Leu Lys Ser Lys Asn lie 
360 365 

Cys Cys lie Ala Asp Leu Gly Leu Ala 
380 

Asn Glu Val Asp Val Pro Leu Asn Thr 
395 400 



60 




WO 95/30003 



PCT/US95/05467 



- 73 - 





Arg 


Val 


Gly 


Thr 


Lys 

405 


Arg 


Tyr 


Met 


Ala 


Pro 

410 


Glu 


Val 


Leu 


Asp 


Glu 

415 


Ser 


5 


Leu 


Asn 


Lys 


Asn 

420 


His 


Phe 


Gin 


Pro 


Tyr 

425 


lie 


Met 


Ala 


Asp 


lie 

430 


Tyr 


Ser 




Phe Gly 


Leu 

435 


lie 


lie 


Trp 


Glu 


Met 

440 


Ala 


Arg 


Arg Cys 


lie 

445 


Thr Gly Gly 


10 


lie 


Val 

450 


Glu 


Glu 


Tyr 


Gin 


Leu 

455 


Pro 


Tyr Tyr Asn 


Met 

460 


Val 


Pro 


Ser 


Asp 


15 


Pro 

465 


Ser 


Tyr 


Glu 


Asp 


Met 

470 


Arg 


Glu 


Val 


Val 


Cys 

475 


Val 


Lys 


Arg 


Leu Arg 
480 




Pro 


He 


Val 


Ser 


Asn 

485 


Arg 


Trp 


Asn 


Ser Asp 
490 


Glu 


Cys 


Leu 


Arg 


Ala 

495 


Val 


20 


Leu 


Lys 


Leu 


Met 

500 


Ser 


Glu 


Cys 


Trp 


Ala 

505 


His 


Asn 


Pro 


Ala 


Ser 

510 


Arg 


Leu 




Thr 


Ala 


Leu 

515 


Arg 


lie 


Lys 


Lys 


Thr 

520 


Leu 


Ala 


Lys 


Met 


Val 

525 


Glu 


Ser 


Gin 



25 Asp Val Lys lie 
530 



30 



35 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1952 base pairs 
{ B ) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



40 



45 



50 



55 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 187.. 1695 

(D) OTHER INFORMATION: /product = “Murine ALK6 " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

AAGCGGCGGC AGAAGTTGCC GGCGTGGTGC TCGTAGTGAG GGCGCGGAGG ACCCGGGACC 

TGGGAAGCGG CGGCGGGTTA ACTTCGGCTG AATCACAACC ATTTGGCGCT GAGCTATGAC 

AAGAGAGCAA ACAAAAAGTT AAAGGAGCAA CCCGGC CATA AGTGAAGAGA GAAGTTTATT 

GATAAC ATG CTC TTA CGA AGC TCT GGA AAA TTA AAT GTG GGC.ACC AAG 
Met Leu Leu Arg Ser Ser Gly Lys Leu Asn Val Gly Thr Lys 
1 5 10 

AAG GAG GAT GGA GAG AGT ACA GCC CCC ACC CCT CGG CCC AAG ATC CTA 
Lys Glu Asp Gly Glu Ser Thr Ala Pro Thr Pro Arg Pro Lys lie Leu 
15 20 25 30 



60 

120 

180 

228 

276 



60 
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CGT 


TGT 


AAA 


TGC 


CAC 


CAC 


CAC 


TGT 


CCG 


GAA GAC 


TCA 


GTC 


AAC 


AAT 


ATC 


c 


Arg 


Cys 


Lys 


Cys 


His 

35 


His 


His 


Cys 


Pro 


Glu Asp 
40 


Ser 


Val 


Asn 


Asn 

45 


lie 


3 


TGC 


AGC 


ACA 


GAT 


GGG 


TAC 


TGC 


TTC 


ACG 


ATG ATA 


GAA 


GAA 


GAT 


GAC 


TCT 




Cys 


Ser 


Thr 


Asp 

50 


Gly 


Tyr 


Cys 


Phe 


Thr 

55 


Met lie 


Glu 


Glu Asp 
60 


Asp 


Ser 


10 


GGA 


ATG 


CCT 


GTT 


GTC 


ACC 


TCT 


GGA 


TGT 


CTA GGA 


CTA 


GAA 


GGG 


TCA 


GAT 




Gly 


Met 


Pro 

65 


Val 


Val 


Thr 


Ser 


Gly Cys 
70 


Leu Gly 


Leu 


Glu Gly 
75 


Ser 


Asp 




TTT 


CAA 


TGT 


CGT 


GAC 


ACT 


CCC 


ATT 


CCT 


CAT CAA 


AGA 


AGA 


TCA 


ATT 


GAA 


15 


Phe 


Gin 

80 


Cys 


Arg 


Asp 


Thr 


Pro 

85 


lie 


Pro 


His Gin Arg 
90 


Arg 


Ser 


lie 


Glu 




TGC 


TGC 


ACA 


GAA 


AGG 


AAT 


GAG 


TGT 


AAT 


AAA GAC 


CTC 


CAC 


CCC 


ACT 


CTG 


20 


Cys 

95 


Cys 


Thr 


Glu 


Arg 


Asn 

100 


Glu 


Cys 


Asn 


Lys Asp 
105 


Leu 


His 


Pro 


Thr 


Leu 

110 




CCT 


CCT 


CTC 


AAG 


GAC 


AGA 


GAT 


TTT 


GTT 


GAT GGG 


CCC 


ATA 


CAC 


CAC 


AAG 


25 


Pro 


Pro 


Leu 


Lys 


Asp 

115 


Arg 


Asp 


Phe 


Val 


Asp Gly 
120 


Pro 


lie 


His 


His 

125 


Lys 


GCC 


TTG 


CTT 


ATC 


TCT 


GTG 


ACT 


GTC 


TGT 


AGT TTA 


CTC 


TTG 


GTC 


CTC 


ATT 




Ala 


Leu 


Leu 


lie 

130 


Ser 


Val 


Thr 


Val 


Cys 

135 


Ser Leu 


Leu 


Leu 


Val 

140 


Leu 


lie 


30 


ATT 


TTA 


TTC 


TGT 


TAC 


TTC 


AGG 


TAT 


AAA 


AGA CAA 


GAA 


GCC 


CGA 


CCT 


CGG 




lie 


Leu 


Phe 

145 


Cys 


Tyr 


Phe 


Arg 


Tyr 

150 


Lys 


Arg Gin Glu 


Ala Arg 
155 


Pro 


Arg 




TAC 


AGC 


ATT 


GGG 


CTG 


GAG 


CAG 


GAC 


GAG 


ACA TAC 


ATT 


CCT 


CCT 


GGA 


GAG 


35 


Tyr 


Ser 

160 


He 


Gly 


Leu 


Glu 


Gin 

165 


Asp 


Glu 


Thr Tyr 


lie 

170 


Pro 


Pro 


Gly 


Glu 




TCC 


CTG 


AGA 


GAC 


TTG 


ATC 


GAG 


CAG 


TCT 


CAG AGC 


TCG 


GGA 


AGT 


GGA 


TCA 


40 


Ser 

175 


Leu Arg 


Asp 


Leu 


lie 

180 


Glu 


Gin 


Ser 


Gin Ser 
185 


Ser 


Gly Ser 


Gly 


Ser 

190 




GGC 


CTC 


CCT 


CTG 


CTG 


GTC 


CAA 


AGG 


ACA 


ATA GCT 


AAG 


CAA 


ATT 


CAG 


ATG 


45 


Gly 


Leu 


Pro 


Leu 


Leu 

195 


Val 


Gin 


Arg 


Thr 


lie Ala 
200 


Lys 


Gin 


lie 


Gin 

205 


Met 


GTG 


AAG 


CAG 


ATT 


GGA 


AAA 


GGC 


CGC 


TAT 


GGC GAG 


GTG 


TGG 


ATG 


GGA 


AAG 




Val 


Lys 


Gin 


lie 

210 


Gly 


Lys 


Gly 


Arg 


Tyr 

215 


Gly Glu 


Val 


Trp 


Met 

220 


Gly 


Lys 


50 


TGG 


CGT 


GGA 


GAA 


AAG 


GTG 


GCT 


GTG 


AAA 


GTG TTC 


TTC 


ACC 


ACG 


GAG 


GAA 




Trp 


Arg Gly 
225 


Glu 


Lys 


Val 


Ala 


Val 

230 


Lys 


Val Phe 


Phe 


Thr 

235 


Thr 


Glu 


Glu 




GCC 


AGC 


TGG 


TTC 


CGA 


GAG 


ACT 


GAG 


ATA 


TAT CAG 


ACG 


GTC 


CTG 


ATG 


CGG 


55 


Ala 


Ser 

240 


Trp 


Phe 


Arg 


Glu 


Thr 

245 


Glu 


lie 


Tyr Gin 


Thr 

250 


Val 


Leu 


Met 


Arg 




CAT 


GAG 


AAT 


ATT 


CTG 


GGG 


TTC 


ATT 


GCT 


GCA GAT 


ATC 


AAA 


GGG 


ACT 


GGG 




His 


Glu 


Asn 


lie 


Leu 


Gly 


Phe 


lie 


Ala 


Ala Asp 


lie 


Lys Gly 


Thr 


Gly 



255 



260 



265 



270 



324 

372 

420 

468 

516 

564 

612 

660 

708 

756 

804 

852 

900 

948 

996 



60 
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TCC 


TGG 


ACT 


CAG 


TTG 


TAC 


CTC 


ATC 


ACA 


GAC 


TAT 


CAT 


GAA 


AAC 


GGC 


TCC 


1044 




Ser 


Trp Thr 


Gin 


Leu 


Tyr 


Leu 


lie 


Thr 


Asp 


Tyr 


His 


Glu 


Asn 


Gly 


Ser 




5 










275 










280 










285 








CTT 


TAT 


GAC 


TAT 


CTG 


AAA 


TCC 


ACC 


ACC 


TTA 


GAC 


GCA 


AAG 


TCC 


ATG 


CTG 


1092 




Leu 


Tyr Asp 


Tyr 


Leu 


Lys 


Ser 


Thr 


Thr 


Leu 


Asp 


Ala 


Lys 


Ser 


Met 


Leu 












290 










295 










300 








10 


AAG 


CTA 


GCC 


TAC 


TCC 


TCT 


GTC 


AGC 


GGC 


CTA 


TGC 


CAT 


TTA 


CAC 


ACG 


GAA 


1140 




Lys 


Leu 


Ala 


Tyr 


Ser 


Ser 


Val 


Ser 


Gly 


Leu 


Cys 


His 


Leu 


His 


Thr 


Glu 










305 










310 










315 












ATC 


TTT 


AGC 


ACT 


CAA 


GGC 


AAG 


CCA 


GCA 


ATC 


GCC 


CAT 


CGA 


GAC 


TTG 


AAA 


1188 


15 


He 


Phe 


Ser 


Thr 


Gin 


Gly 


Lys 


Pro 


Ala 


lie 


Ala 


His 


Arg 


Asp 


Leu 


Lys 








320 










325 










330 










AGT 


AAA 


AAC 


ATC 


CTG 


GTG 


AAG 


AAA 


AAT 


GGA 


ACT 


TGC 


TGC 


ATA 


GCA 


GAC 


1236 




Ser 


Lys 


Asn 


lie 


Leu 


Val 


Lys 


Lys 


Asn 


Gly 


Thr 


Cys 


Cys 


lie 


Ala 


Asp 




20 


335 










340 










345 








350 






CTG 


GGC 


TTG 


GCT 


GTC 


AAG 


TTC 


ATT 


AGT 


GAC 


ACA 


AAT 


GAG 


GTT 


GAC 


ATC 


1284 




Leu 


Gly Leu 


Ala 


Val 


Lys 


Phe 


lie 


Ser 


Asp 


Thr 


Asn 


Glu 


Val 


Asp 


lie 




25 










355 










360 










365 






CCA 


CCC 


AAC 


ACC 


CGG 


GTT 


GGC 


ACC 


AAG 


CGC 


TAT 


ATG 


CCT 


CCA 


GAA 


GTG 


1332 




Pro 


Pro 


Asn 


Thr 


Arg 


Val 


Gly 


Thr 


Lys 


Arg 


Tyr 


Met 


Pro 


Pro 


Glu 


Val 












370 










375 










380 








30 


CTG 


GAC 


GAG 


AGC 


TTG 


AAT 


AGA 


AAC 


CAT 


TTC 


CAG 


TCC 


TAC 


ATT 


ATG 


GCT 


1380 




Leu 


Asp 


Glu 


Ser 


Leu 


Asn 


Arg 


Asn 


His 


Phe 


Gin 


Ser 


Tyr 


lie 


Met 


Ala 










385 










390 










395 












GAC 


ATG 


TAC 


AGC 


TTT 


GGA 


CTC 


ATC 


CTC 


TGG 


GAG 


ATT 


GCA 


AGG 


AGA 


TGT 


1428 


35 


Asp 


Met 


Tyr 


Ser 


Phe 


Gly 


Leu 


lie 


Leu 


Trp 


Glu 


He 


Ala 


Arg 


Arg 


Cys 








400 










405 










410 










GTT 


TCT 


GGA 


GGT 


ATA 


GTG 


GAA 


GAA 


TAC 


CAG 


CTT 


CCC 


TAT 


CAC 


GAC 


CTG 


1476 




Val 


Ser Gly 


Gly 


lie 


Val 


Glu 


Glu 


Tyr 


Gin 


Leu 


Pro 


Tyr 


His 


Asp 


Leu 




40 


415 










420 










425 








430 






GTG 


CCC 


AGT 


GAC 


CCT 


TCT 


TAT 


GAG 


GAC 


ATG 


AGA 


GAA 


ATT 


GTG 


TGC 


ATG 


1524 




Val 


Pro 


Ser 


Asp 


Pro 


Ser 


Tyr 


Glu 


Asp 


Met 


Arg 


Glu 


lie 


Val 


Cys 


Met 




45 










435 










440 










445 






AAG 


AAG 


TTA 


CGG 


CCT 


TCA 


TTC 


CCC 


AAT 


CGA 


TGG 


AGC 


AGT 


GAT 


GAG 


TGT 


1572 




Lys 


Lys 


Leu 


Arg 


Pro 


Ser 


Phe 


Pro 


Asn 


Arg 


Trp 


Ser 


Ser 


Asp 


Glu 


Cys 












450 










455 










460 






50 


CTC 


AGG 


CAG 


ATG 


GGG 


AAG 


CTT 


ATG 


ACA 


GAG 


TGC 


TGG 


GCG 


CAG 


AAT 


CCT 


1620 




Leu 


Arg 


Gin 


Met 


Gly 


Lys 


Leu 


Met 


Thr 


Glu 


Cys 


Trp 


Ala 


Gin 


Asn 


Pro 










465 










470 










475 












GCC 


TCC 


AGG 


CTG 


ACG 


GCC 


CTG 


AGA 


GTT 


AAG 


AAA 


ACC 


CTT 


GCC 


AAA 


ATG 


1668 


55 


Ala 


Ser 


Arg 


Leu 


Thr 


Ala 


Leu 


Arg 


Val 


Lys 


Lys 


Thr 


Leu 


Ala 


Lys 


Met 








480 










485 










490 














TCA 


GAG 


TCC 


CAG 


GAC 


ATT 


AAA 


CTC 


TGACGTCAGA TACTTGTGGA CAGAGCAAGA 


1722 




Ser 


Glu 


Ser 


Gin 


Asp 


lie 


Lys 


Leu 





















495 500 



60 
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5 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



ATTTCACAGA AGCATCGTTA GCCCAAGCCT TGAACGTTAG CCTACTGCCC AGTGAGTTCA 
GACTTTCCTG GAAGAGAGCA CGGTGGGCAG ACACAGAGGA ACCCAGAAAC ACGGATTCAT 
CATGGCTTTC TGAGGAGGAG AAACTGTTTG GGTAACTTGT TCAAGATATG ATGCATGTTG 
CTTTCTAAGA AAGCCCTGTA TTTTGAATTA CCATTTTTTT ATAAAAAAAA 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 502 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) i 
(xi) i 


MOLECULE 

SEQUENCE 


TYPE: protein 
DESCRIPTION: SEQ ID 


NO: 8: 






Met Leu 
1 


Leu 


Arg 


Ser 

5 


Ser 


Gly Lys Leu 


Asn 

10 


Val 


Gly Thr 


Lys Lys 
15 


Glu 


Asp Gly 


Glu 


Ser 

20 


Thr 


Ala 


Pro 


Thr Pro 
25 


Arg 


Pro 


Lys lie 


Leu Arg 
30 


Cys 


Lys Cys 


His 

35 


His 


His 


Cys 


Pro 


Glu Asp 
40 


Ser 


Val 


Asn Asn 
45 


lie Cys 


Ser 


Thr Asp 
50 


Gly 


Tyr 


Cys 


Phe 


Thr 

55 


Met lie 


Glu 


Glu 


Asp Asp 
60 


Ser Gly 


Met 


Pro Val 
65 


Val 


Thr 


Ser 


Gly 

70 


Cys 


Leu Gly 


Leu 


Glu 

75 


Gly Ser 


Asp Phe 


Gin 

80 


Cys Arg 


Asp 


Thr 


Pro 

85 


lie 


Pro 


His Gin 


Arg 

90 


Arg 


Ser lie 


Glu Cys 
95 


Cys 


Thr Glu 


Arg 


Asn 

1Q0 


Glu 


Cys 


Asn 


Lys Asp 
105 


Leu 


His 


Pro Thr 


Leu Pro 
110 


Pro 


Leu Lys 


Asp 

115 


Arg 


Asp 


Phe 


Val 


Asp Gly 
120 


Pro 


lie 


His His 
125 


Lys Ala 


Leu 


Leu lie 
130 


Ser 


Val 


Thr 


Val 


Cys 

135 


Ser Leu 


Leu 


Leu 


Val Leu 
140 


lie lie 


Leu 


Phe Cys 
145 


Tyr 


Phe 


Arg 


Tyr 

150 


Lys Arg Gin 


Glu 


Ala 

155 


Arg Pro 


Arg Tyr 


Ser 

160 


He Gly 


Leu 


Glu 


Gin 

165 


Asp 


Glu 


Thr Tyr 


lie 

170 


Pro 


Pro Gly 


Glu Ser 
175 


Leu 


Arg Asp 


Leu 


lie 

180 


Glu 


Gin 


Ser 


Gin Ser 
185 


Ser 


Gly 


Ser Gly 


Ser Gly 
190 


Leu 


Pro Leu 


Leu 

195 


Val 


Gin 


Arg 


Thr 


lie Ala 
200 


Lys 


Gin 


lie Gin 
205 


Met Val 


Lys 



1782 

1842 

1902 

1952 
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Gin 


He 

210 


Gly 


Lys 


Gly Arg 


Tyr 

215 


Gly 


Glu 


Val 


Trp 


Met 

220 


Gly 


Lys 


Trp 


Arg 


5 


Gly 

225 


Glu 


Lys 


Val 


Ala Val 
230 


Lys 


Val 


Phe 


Phe 


Thr 

235 


Thr 


Glu 


Glu 


Ala 


Ser 

240 




Trp 


Phe 


Arg 


Glu 


Thr Glu 
245 


lie 


Tyr 


Gin 


Thr 

250 


Val 


Leu 


Met 


Arg 


His 

255 


Glu 


10 


Asn 


lie 


Leu 


Gly 

260 


Phe lie 


Ala 


Ala 


Asp 

265 


lie 


Lys 


Gly 


Thr. 


Gly 

270 


Ser 


Trp 




Thr 


Gin 


Leu 

275 


Tyr 


Leu lie 


Thr 


Asp 

280 


Tyr 


His 


Glu 


Asn 


Gly 

285 


Ser 


Leu 


Tyr 


15 




Asp 


Tyr 

290 


Leu 


Lys 


Ser Thr 


Thr 

295 


Leu 


Asp 


Ala 


Lys 


Ser 

300 


Met 


Leu 


Lys 


Leu 


20 


Ala 

305 


Tyr 


Ser 


Ser 


Val Ser 
310 


Gly 


Leu 


Cys 


His 


Leu 

315 


His 


Thr 


Glu 


lie 


Phe 

320 




Ser 


Thr 


Gin 


Gly 


Lys Pro 
325 


Ala 


lie 


Ala 


His 

330 


Arg 


Asp 


Leu 


Lys 


Ser 

335 


Lys 


25 


Asn 


lie 


Leu 


Val 

340 


Lys Lys 


Asn 


Gly 


Thr 

345 


Cys 


Cys 


lie 


Ala 


Asp 

350 


Leu 


Gly 


30 


Leu 


Ala 


Val 

355 


Lys 


Phe lie 


Ser 


Asp 

360 


Thr 


Asn 


Glu 


Val 


Asp 

365 


lie 


Pro 


Pro 




Asn 


Thr 

370 


Arg 


Val 


Gly Thr 


Lys 

375 


Arg 


Tyr 


Met 


Pro 


Pro 

380 


Glu 


Val 


Leu 


Asp 


35 


Glu 

385 


Ser 


Leu 


Asn 


Arg Asn 
390 


His 


Phe 


Gin 


Ser 


Tyr 

395 


lie 


Met 


Ala 


Asp 


Met 

400 




Tyr 


Ser 


Phe 


Gly 


Leu lie 
405 


Leu 


Trp 


Glu 


lie 

410 


Ala 


Arg 


Arg 


Cys 


Val 

415 


Ser 


40 


Gly 


Gly 


lie 


Val 

420 


Glu Glu 


Tyr 


Gin 


Leu 

425 


Pro 


Tyr 


His 


Asp 


Leu 

430 


Val 


Pro 




Ser 


Asp 


Pro 

435 


Ser 


Tyr Glu 


Asp 


Met 

440 


Arg 


Glu 


lie 


Val 


Cys 

445 


Met 


Lys 


Lys 


45 




Leu 


Arg 

450 


Pro 


Ser 


Phe Pro 


Asn 

455 


Arg 


Trp 


Ser 


Ser 


Asp 

460 


Glu 


Cys 


Leu 


Arg 


50 


Gin 

465 


Met 


Gly 


Lys 


Leu Met 
470 


Thr 


Glu 


Cys 


Trp 


Ala 

475 


Gin 


Asn 


Pro 


Ala 


Ser 

480 




Arg 


Leu 


Thr 


Ala 


Leu Arg 
485 


Val 


Lys 


Lys 


Thr 

490 


Leu 


Ala 


Lys 


Met 


Ser 

495 


Glu 



55 Ser Gin Asp lie Lys Leu 
500 



(2) INFORMATION FOR SEQ ID NO: 9: 



60 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1822 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



10 



15 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 49.. 1341 

(D) OTHER INFORMATION: /function^ "OSTEOGENIC PROTEIN" 
/product= "OP1" 

/evidence- EXPERIMENTAL 
/standard_name= "OPl" 



(Xi> SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



20 GGTGCGGGCC CGGAGCCCGG AGCCCGGGTA GCGCGTAGAG CCGGCGCG ATG CAC GTG 57 

Met His Val 
1 





CGC 


TCA 


CTG 


CGA 


GCT 


GCG 


GCG 


CCG 


CAC 


AGC 


TTC 


GTG 


GCG 


CTC 


TGG 


GCA 


105 


25 


Arg 


Ser 


Leu 


Arg 


Ala 


Ala 


Ala 


Pro 


His 


Ser 


Phe 


Val 


Ala 


Leu 


Trp 


Ala 








5 










10 










15 












CCC 


CTG 


TTC 


CTG 


CTG 


CGC 


TCC 


GCC 


CTG 


GCC 


GAC 


TTC 


AGC 


CTG 


GAC 


AAC 


153 




Pro 


Leu 


Phe 


Leu 


Leu 


Arg 


Ser 


Ala 


Leu 


Ala 


Asp 


Phe 


Ser 


Leu 


Asp 


Asn 




30 


20 










25 










30 








35 






GAG 


GTG 


CAC 


TCG 


AGC 


TTC 


ATC 


CAC 


CGG 


CGC 


CTC 


CGC 


AGC 


CAG 


GAG 


CGG 


201 




Glu 


Val 


His 


Ser 


Ser 


Phe 


He 


His 


Arg 


Arg 


Leu 


Arg 


Ser 


Gin 


Glu 


Arg 




35 










40 










45 










50 






CGG 


GAG 


ATG 


CAG 


CGC 


GAG 


ATC 


CTC 


TCC 


ATT 


TTG GGC 


TTG 


CCC 


CAC 


CGC 


249 




Arg 


Glu 


Met 


Gin 


Arg 


Glu 


lie 


Leu 


Ser 


lie 


Leu 


Gly 


Leu 


Pro 


His 


Arg 












55 










60 








65 






40 


CCG 


CGC 


CCG 


CAC 


CTC 


CAG 


GGC 


AAG 


CAC 


AAC 


TCG 


GCA 


CCC 


ATG 


TTC 


ATG 


297 




Pro 


Arg 


Pro 


His 


Leu 


Gin 


Gly 


Lys 


His 


Asn 


Ser 


Ala 


Pro 


Met 


Phe 


Met 










70 










75 










80 












CTG 


GAC 


CTG 


TAC 


AAC 


GCC 


ATG 


GCG 


GTG 


GAG 


GAG 


GGC 


GGC 


GGG 


CCC 


GGC 


345 


45 


Leu 


Asp 


Leu 


Tyr 


Asn 


Ala 


Met 


Ala 


Val 


Glu 


Glu 


Gly 


Gly 


Gly 


Pro 


Gly 








85 










90 










95 










GGC 


CAG 


GGC 


TTC 


TCC 


TAC 


CCC 


TAC 


AAG 


GCC 


GTC 


TTC 


AGT 


ACC 


CAG 


GGC 


393 




Gly 


Gin 


Gly 


Phe 


Ser 


Tyr 


Pro 


Tyr 


Lys 


Ala 


Val 


Phe 


Ser 


Thr 


Gin 


Gly 




50 


100 










105 










110 










115 






CCC 


CCT 


CTG 


GCC 


AGC 


CTG 


CAA 


GAT 


AGC 


CAT 


TTC 


CTC 


ACC 


GAC 


GCC 


GAC 


441 




Pro 


Pro 


Leu 


Ala 


Ser 


Leu 


Gin 


Asp 


Ser 


His 


Phe 


Leu 


Thr 


Asp 


Ala 


Asp 




55 










120 










125 










130 




ATG 


GTC 


ATG 


AGC 


TTC 


GTC 


AAC 


CTC 


GTG 


GAA 


CAT 


GAC 


AAG 


GAA 


TTC 


TTC 


489 




Met 


Val. 


Met 


Ser 


Phe 


Val 


Asn 


Leu 


Val 


Glu 


His 


Asp 


Lys 


Glu 


Phe 


Phe 












135 










140 










145 








60 


CAC 


CCA 


CGC 


TAC 


CAC 


CAT 


CGA 


GAG 


TTC 


CGG 


TTT 


GAT 


CTT 


TCC 


AAG 


ATC 


537 
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55 



His 


Pro 


Arg 

150 


Tyr 


His 


His Arg 


Glu 

155 


Phe 


Arg 


Phe 


Asp 


Leu 

160 


Ser 


Lys 


lie 




CCA 


GAA 


GGG 


GAA 


GCT 


GTC ACG 


GCA 


GCC 


GAA 


TTC 


CGG 


ATC 


TAC 


AAG 


GAC 


585 


Pro 


Glu 

165 


Gly 


Glu 


Ala 


Val Thr 
170 


Ala 


Ala 


Glu 


Phe 


Arg 

175 


lie 


Tyr 


Lys 


Asp 




TAC 


ATC 


CGG 


GAA 


CGC 


TTC GAC 


AAT 


GAG 


ACG 


TTC 


CGG 


ATC 


AGC 


GTT 


TAT 


633 


Tyr 

180 


He 


Arg 


Glu 


Arg 


Phe Asp 
185 


Asn 


Glu 


Thr 


Phe 

190 


Arg 


lie 


Ser 


Val 


Tyr 

195 




CAG 


GTG 


CTC 


CAG 


GAG 


CAC TTG 


GGC 


AGG 


GAA 


TCG 


GAT 


CTC 


TTC 


CTG 


CTC 


681 


Gin 


Val 


Leu 


Gin 


Glu 

200 


His Leu 


Gly 


Arg 


Glu 

205 


Ser 


Asp 


Leu 


Phe 


Leu 

210 


Leu 




GAC 


AGC 


CGT 


ACC 


CTC 


TGG GCC 


TCG 


GAG 


GAG 


GGC 


TGG 


CTG 


GTG 


TTT 


GAC 


729 


Asp 


Ser 


Arg 


Thr 

215 


Leu 


Trp Ala 


Ser 


Glu 

220 


Glu Gly 


Trp 


Leu 


Val 

225 


Phe 


Asp 




ATC 


ACA 


GCC 


ACC 


AGC 


AAC CAC 


TGG 


GTG 


GTC 


AAT 


CCG 


CGG 


CAC 


AAC 


CTG 


777 


lie 


Thr 


Ala 

230 


Thr 


Ser 


Asn His 


Trp 

235 


Val 


Val 


Asn 


Pro 


Arg 

240 


His 


Asn 


Leu 




GGC 


CTG 


CAG 


CTC 


TCG 


GTG GAG 


ACG 


CTG 


GAT 


GGG 


CAG 


AGC 


ATC 


AAC 


CCC 


825 


Gly 


Leu 

245 


Gin 


Leu 


Ser 


Val Glu 
250 


Thr 


Leu 


Asp Gly 


Gin 

255 


Ser 


lie 


Asn 


Pro 




AAG 


TTG 


GCG 


GGC 


CTG 


ATT GGG 


CGG 


CAC 


GGG 


CCC 


CAG 


AAC 


AAG 


CAG 


CCC 


873 


Lys 

260 


Leu 


Ala 


Gly Leu 


lie Gly 
265 


Arg 


His 


Gly 


Pro 

270 


Gin 


Asn 


Lys 


Gin 


Pro 

275 




TTC 


ATG 


GTG 


GCT 


TTC 


TTC AAG 


GCC 


ACG 


GAG 


GTC 


CAC 


TTC 


CGC 


AGC 


ATC 


921 


Phe 


Met 


Val 


Ala 


Phe 

280 


Phe Lys 


Ala 


Thr 


Glu 

285 


Val 


His 


Phe 


Arg 


Ser 

290 


lie 




CGG 


TCC 


ACG 


GGG 


AGC 


AAA CAG 


CGC 


AGC 


CAG 


AAC 


CGC 


TCC 


AAG 


ACG 


CCC 


.969 


Arg 


Ser 


Thr 


Gly .Ser 
295 


Lys Gin 


Arg 


Ser 

300 


Gin 


Asn Arg 


Ser 


Lys 

305 


Thr 


Pro 




AAG 


AAC 


CAG 


GAA 


GCC 


CTG CGG 


ATG 


GCC 


AAC 


GTG 


GCA 


GAG 


AAC 


AGC 


AGC 


1017 


Lys 


Asn 


Gin 

310 


Glu 


Ala 


Leu Arg 


Met 

315 


Ala 


Asn 


Val 


Ala 


Glu 

320 


Asn 


Ser 


Ser 




AGC 


GAC 


CAG 


AGG 


CAG 


GCC TGT 


AAG 


AAG 


CAC. GAG 


CTG 


TAT 


GTC 


AGC 


TTC 


1065 


Ser 


Asp 

325 


Gin 


Arg Gin 


Ala Cys 
330 


Lys 


Lys 


His 


Glu 


Leu 

335 


Tyr 


Val 


Ser 


Phe 




CGA 


GAC 


CTG 


GGC 


TGG 


CAG GAC 


TGG 


ATC 


ATC 


GCG 


CCT 


GAA 


GGC 


TAC 


GCC 


1113 


Arg 

340 


Asp 


Leu 


Gly Trp 


Gin Asp 
345 


Trp 


lie 


He 


Ala 

350 


Pro 


Glu 


Gly Tyr 


Ala 

355 




GCC 


TAC 


TAC 


TGT 


GAG 


GGG GAG 


TGT 


GCC 


TTC 


CCT 


CTG 


AAC 


TCC 


TAC 


ATG 


1161 


Ala 


Tyr 


Tyr 


Cys 


Glu 

360 


Gly Glu 


Cys 


Ala 


Phe 

365 


Pro 


Leu 


Asn 


Ser 


Tyr 

370 


Met 




AAC 


GCC 


ACC 


AAC 


CAC 


GCC ATC 


GTG 


CAG 


ACG 


CTG 


GTC 


CAC 


TTC 


ATC 


AAC 


1209 


Asn 


Ala 


Thr 


Asn 

375 


His 


Ala lie 


Val 


Gin 

380 


Thr 


Leu 


Val 


His 


Phe 

385 


lie 


Asn 




CCG 


GAA 


ACG 


GTG. 


CCC 


AAG CCC 


TGC 


TGT 


GCG 


CCC 


ACG 


CAG 


CTC 


AAT 


GCC 


1257 
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Pro Glu Thr Val Pro Lys Pro Cys Cys Ala Pro 
390 395 

ATC TCC GTC CTC TAC TTC GAT GAC AGC TCC AAC 
lie Ser Val Leu Tyr Phe Asp Asp Ser Ser Asn 
405 410 



TAC AGA AAC ATG GTG GTC CGG GCC TGT GGC TGC 
Tyr Arg Asn Met Val Val Arc Ala Cys Gly Cys 
420 425 430 

GAGAATTCAG ACCCTTTGGG GCCAAGTTTT TCTGGATCCT 

G AAC C AGC AG ACCAACT GCC TTTTGTGAGA CCTTCCCCTC 

TGTGAGAGTA TTAGGAAACA TGAGCAGCAT ATGGCTTTTG 

ATCCAATGAA CAAGATCCTA CAAGCTGTGC AGGCAAAACC 

GCATAAAGAA AAATGGCCGG GCCAGGTCAT TGGCTGGGAA 

CGTTTCCAGA GGTAATTATG AGCGCCTACC AGCCAGGCCA 

GGCGTGGCAA GGGGTGGGCA CATTGGTGTC TGTGCGAAAG 

CTGTAATAAA TGTCACAATA AAACGAATGA ATGAAAAAAA 



Thr Gin Leu Asn Ala 
400 

GTC ATC CTG AAG AAA 
Val lie Leu Lys Lys 
415 

CAC TAGCTCCTCC 
His 

CCATTGCTCG CCTTGGCCAG 
CCTATCCCCA ACTTTAAAGG 
ATCAGTTTTT CAGTGGCAGC 
T AGC AG G AAA AAAAAACAAC 
GTCTCAGCCA TGCACGGACT 
CCCAGCCGTG GGAGGAAGGG 
GAAAATTGAC CCGGAAGTTC 
AAAAAAAAAA A 



(2) INFORMATION FOR SEQ ID NO: 10: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 431 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

35 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



40 


Met 

1 


His Val 


Arg 


Ser 

5 


Leu Arg Ala 


Ala 


Ala 

10 


Pro 


His 


Ser 


Phe 


Val 

15 


Ala 


45 


Leu 


Trp Ala 


Pro 

20 


Leu 


Phe Leu Leu 


Arg 

25 


Ser 


Ala 


Leu 


Ala 


Asp 

30 


Phe 


Ser 




Leu 


Asp Asn 
35 


Glu 


Val 


His Ser Ser 
40 


Phe 


He 


His 


Arg 


Arg 

45 


Leu 


Arg 


Ser 


50 


Gin 


Glu Arg 
50 


Arg 


Glu 


Met Gin Arg 
55 


Glu 


lie 


Leu 


Ser 

60 


lie 


Leu 


Gly 


Leu 




Pro 

65 


His Arg 


Pro 


Arg 


Pro His Leu 
70 


Gin 


Gly Lys 
75 


His 


Asn 


Ser 


Ala 


Pro 

80 


55 


Met 


Phe Met 


Leu 


Asp 

85 


Leu Tyr Asn 


Ala 


Met 

90 


Ala 


Val 


Glu 


Glu 


Gly 

95 


Gly 




Gly 


Pro Gly Gly Gin Gly Phe Ser 
100 


Tyr 

105 


Pro 


Tyr 


Lys 


Ala 


Val 

110 


Phe 


Ser 



1305 

1351 

1411 

1471 

1531. 

1591 

1651 

1711 

1771 

1822 
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Thr 


Gin 


Gly 

115 


Pro 


Pro 


Leu 


Ala 


Ser 

120 


Leu 


Gin 


Asp 


Ser 


His 

125 


Phe 


Leu 


Thr 


Asp 


Ala 

130 


Asp 


Met 


Val 


Met 


Ser 

135 


Phe 


Val 


Asn 


Leu 


Val 

140 


Glu 


His 


Asp 


Lys 


Glu 

145 


Phe 


Phe 


His 


Pro 


Arg 

150 


Tyr 


His 


His 


Arg 


Glu 

155 


Phe 


Arg 


Phe 


Asp 


Leu 

160 


Ser 


Lys 


lie 


Pro 


Glu 

165 


Gly 


Glu 


Ala 


Val 


Thr 

170 


Ala 


Ala 


Glu 


Phe 


Arg 

175 


lie 


Tyr 


Lys 


Asp 


Tyr 

180 


lie 


Arg 


Glu 


Arg 


Phe 

185 


Asp 


Asn 


Glu 


Thr 


Phe 

190 


Arg 


lie 


Ser 


Val 


Tyr 

195 


Gin 


Val 


Leu 


Gin 


Glu 

200 


His 


Leu 


Gly 


Arg 


Glu 

205 


Ser 


Asp 


Leu 


Phe 


Leu 

210 


Leu 


Asp 


Ser 


Arg 


Thr 

215 


Leu 


Trp 


Ala 


Ser 


Glu 

220 


Glu 


Gly 


Trp 


Leu 


Val 

225 


Phe 


Asp 


lie 


Thr 


Ala 

230 


Thr 


Ser 


Asn 


His 


Trp 

235 


Val 


Val. 


Asn 


Pro 


Arg 

240 


His 


Asn 


Leu 


Gly 


Leu 

245 


Gin 


Leu 


Ser 


Val 


Glu 

250 


Thr 


Leu 


Asp Gly 


Gin 

255 


Ser 


lie 


Asn 


Pro 


Lys 

260 


Leu 


Ala 


Gly 


Leu 


lie 

265 


Gly 


Arg 


His 


Gly 


Pro 

270 


Gin 


Asn 


Lys 


Gin 


Pro 

275 


Phe 


Met 


Val 


Ala 


Phe 

280 


Phe 


Lys 


Ala 


Thr 


Glu 

285 


Val 


His 


Phe 


Arg 


Ser 

290 


lie 


Arg 


Ser 


Thr 


Gly 

295 


Ser 


Lys 


Gin 


Arg 


Ser 

300 


Gin 


Asn 


Arg 


Ser 


Lys 

305 


Thr 


Pro 


Lys 


Asn 


Gin 

310 


Glu 


Ala 


Leu 


Arg 


Met 

315 


Ala 


Asn 


Val 


Ala 


Glu 

320 


Asn 


Ser 


Ser 


Ser 


Asp 

325 


Gin 


Arg 


Gin 


Ala 


Cys 

330 


Lys 


Lys 


His 


Glu 


Leu 

335 


Tyr 


Val 


Ser 


Phe 


Arg 

340 


Asp 


Leu 


Gly 


Trp 


Gin 

345 


Asp 


Trp 


lie 


lie 


Ala 

350 


Pro 


Glu 


Gly 


Tyr 


Ala 

355 


Ala 


Tyr 


Tyr 


Cys 


Glu 

360 


Gly 


Glu 


Cys 


Ala 


Phe 

365 


Pro 


Leu 


Asn 


Ser 


Tyr 

370 


Met 


Asn 


Ala 


Thr 


Asn 

375 


His 


Ala 


lie 


Val 


Gin 

380 


Thr 


Leu 


Val 


His 


Phe 

385 


He 


Asn 


Pro 


Glu 


Thr 

390 


Val 


Pro 


Lys 


Pro 


Cys 

395 


Cys 


Ala 


Pro 


Thr 


Gin 

400 


Leu 


Asn 


Ala 


lie 


Ser 

405 


Val 


Leu 


Tyr 


Phe 


Asp 

410 


Asp 


Ser 


Ser 


Asn 


Val 

415 


lie 


Leu 


Lys 


Lys 


Tyr 

420 


Arg 


Asn 


Met 


Val 


Val 

425 


Arg 


Ala 


Cys 


Gly Cys 
430 


His 


* 



60 
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(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

10 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..102 

(D) OTHER INFORMATION: /label= OPX 

15 /note= "Each Xaa is independently selected from 

a group of one or more specified amino acids 
as defined in the specification" 



20 



25 



30 



35 



40 



45 



50 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:11: 

Cys Xaa Xaa His Glu Leu Tyr Val Xaa Phe Xaa Asp Leu Gly Trp Xaa 
15 10 15 

Asp Trp Xaa lie Ala Pro Xaa Gly Tyr Xaa Ala Tyr Tyr Cys Glu Gly 
20 25 30 

Glu Cys Xaa Phe Pro Leu Xaa Ser Xaa Met Asn Ala Thr Asn His Ala 
35 40 45 

He Xaa Gin Xaa Leu Val His Xaa Xaa Xaa Pro Xaa Xaa Val Pro Lys 
50 55 60 

Xaa Cys Cys Ala Pro Thr Xaa Leu Xaa Ala Xaa Ser Val Leu Tyr Xaa 

65 70 75 80 

Asp Xaa Ser Xaa Asn Val Xaa Leu Xaa Lys Xaa Arg Asn Met Val Val 

85 90 95 

Xaa Ala Cys Gly Cys His 
100 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



60 
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GCGGATCCTG TTGTGAAGGN AATATGTG 28 

(2) INFORMATION FOR SEQ ID NO: 13: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

10 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
15 (iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

20 

GCGATCCGTC GCAGTCAAAA TTTT 24 

(2) INFORMATION FOR SEQ ID NO: 14: 

25 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
30 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
35 (iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

40 

GCGGATCCGC GATATATTAA AAGCAA 26 

(2) INFORMATION FOR SEQ ID NO: 15: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
50 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
55 (iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 15: 



60 



CGGAATTCTG GTGCCATATA 
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What is claimed is: 
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A method for identifying an OP-1 receptor-binding analog, said 
analog being characterized as having substantially the same 
binding affinity for a cell surface receptor as OP-1, the method 
comprising the steps of: 

(a) providing a sample containing a protein selected from the group 
consisting of: 

(i) a polypeptide chain comprising an amino acid sequence 
defined by residues 16-123 of Seq. ID No. 3 (ALK-2 ) , or an 
OPl-binding analog thereof; 

(ii) a polypeptide chain comprising an amino acid sequence 
defined by residues 24-152 of Seq. ID No. 5 (ALK-3 ) , , or an 
OPl-binding analog thereof; 

(iii) -a polypeptide chain comprising an amino acid sequence 
defined by residues 23-122 of Seq. ID No. 7 (ALK-6 ) , , or an 
OP1 binding analog thereof; 

(iv) a polypeptide chain having binding affinity for OP-1 and 
sharing at least 40% amino acid identity with residues 23- 
122 of Seq. ID No. 7 (ALK-6),; 

(v) a polypeptide chain having binding affinity for OP-1 and 
encoded by a nucleic acid obtainable by amplification with 
one or more primer sequences defined by Seq. ID Nos. 12-15; 
or 

(vi) a polypeptide chain having binding affinity for OP-1 and 
encoded by a nucleic acid that hybridizes under stringent 
conditions with a nucleic acid comprising the sequence 
defined by nucleotides 256-552 of Seq. ID No. 7 (ALK-6),; 

(b) contacting said sample with a candidate OPl receptor- binding 
analog; and 

(c) detecting specific binding between said candidate OPl receptor- 
binding analog and said protein. 

A method for identifying an OP-1 receptor-binding analog, said 
analog being characterized as having substantially the same 
binding affinity for a cell surface receptor as OPl, the method 
comprising the steps of: 
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15 



16 



(a) providing a cell that expresses a surface receptor protein having 
binding specificity for OP-1 selected from the group consisting 
of : 

(i) a polypeptide chain comprising an amino acid sequence 

defined by residues 16-123 of Seq. ID No. 3 (ALK-2 ) , or an 
OPl-binding analog thereof; 

(ii) a polypeptide chain comprising an amino acid sequence 

defined by residues 24-152 of Seq. ID No. 5 (ALK-3 ) , , or an 
OPl-binding analog thereof; 

(iii) a polypeptide chain comprising an amino acid sequence 

defined by residues 23-122 of Seq. ID No. 7 (ALK-6),, or an 
OPl binding analog thereof; 



17 

18 

19 

20 
21 
22 

23 

24 

25 

26 

27 

28 

29 

30 

1 3. 

2 

3 

4 

1 4. 

2 
3 

1 5. 

2 
3 



(iv) a polypeptide chain having binding affinity for OP-1 and 
sharing at least 40% amino acid identity with residues 23- 
122 of Seq. ID No. 7 (ALK-6),; 

(v) a polypeptide chain having binding affinity for OP-1 and 
encoded by a nucleic acid obtainable by amplification with 
one or more primer sequences defined by Seq. ID Nos. 12-15; 
or 

(vi) a polypeptide chain having binding affinity for OP-1 and 
encoded by a nucleic acid that hybridizes under stringent 
conditions with a nucleic acid comprising the sequence 
defined by nucleotides 256-552 of Seq. ID No. 7 (ALK-6),; 

(b) contacting said cell with a candidate OPl receptor-binding 
analog; and 

(c) detecting induction of an OP-l-mediated cellular response. 

The method of claim 2 wherein said OP-1 mediated cellular response 
detected in step (c) is induction of a kinase activity, inhibition of 
epithelial cell growth, or induction of a cell differentiation 
marker . 

The method of claim 2 or 3 wherein said cell comprises a transfected 
nucleic acid comprising a reporter gene in operative association with a 
control element derived from an OP-1 inducible protein. 

The method of any of claims 1-4 wherein said sample further comprises 
part or all of a Type II serine/threonine kinase receptor protein 
having binding affinity for OP-1, activin or BMP-4. 
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1 6, A method for producing an OP-1 receptor binding analog, the method 

2 comprising the steps of: 

3 (a) obtaining, by the method of any of claims 1-5, a candidate OP-1 

4 binding analog, and 
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(b) producing either said candidate analog or a second OP-1 binding 

analog derived from said candidate and having substantially the 

same OP-1 receptor-binding domain as said candidate. 

The method of producing an OP-1 receptor-binding analog of claim 6 
wherein said analog produced in step (b) is by recombinant DNA 
techniques, or by nonbiological peptide synthesis. 

A kit for identifying OP-1 or a candidate OP-1 receptor binding analog 
in a sample, the kit comprising: 

(a) a receptacle adapted to receive a sample and containing a protein 
selected from the group consisting of : 

(i) a polypeptide chain comprising an amino acid sequence 
defined by residues 16-123 of Seq. ID No. 3 (ALK-2) , or an 
OPl-binding analog thereof; 

(ii) a polypeptide chain comprising an amino acid sequence 
defined by residues 24-152 of Seq. ID No. 5 ( ALK-3 ) , , or an 
OPl-binding analog thereof; 

(iii) a polypeptide chain comprising an amino acid sequence 
defined by residues 23-122 of Seq. ID No. 7 (ALK-6) , , or an 

• OP1 binding analog thereof; 

(iv) a polypeptide chain having binding affinity for OP-1 and 
sharing at least 40% amino acid identity with residues 23- 
122 of Seq. ID No. 7 (ALK-6),; 

(v) a polypeptide chain having binding affinity for OP-1 and 
encoded by a nucleic acid obtainable by amplification with 
one or more primer sequences defined by Seq. ID Nos. 12-15; 
or 

(vi) a polypeptide chain having binding affinity for OP-1 and 
encoded by a nucleic acid that hybridizes under stringent 
conditions with a nucleic acid comprising the sequence 
defined by nucleotides 256-552 of Seq. ID No. 7 (ALK-6),; 
and 

(b) means for detecting interaction of OP-1 or a candidate OP-1 
receptor-binding analog with said protein of part (a) , said OP-1 
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or candidate analog comprising part of said sample provided to 
said receptacle. 

The kit of claim 8 wherein said means in part (b) comprises either 

(i) means for detecting specific binding interaction of OP-1 
or said candidate analog with said protein; or 

(ii) means for detecting induction of an OP-1 mediated cellular 
response. 

The kit of claim 8 or 9 further comprising a serine/ threonine Type II 
receptor having binding specificity for OP-1, activin or BMP-4. 

An OP-1 receptor-binding analog produced by the method of any of claims 
1-7 or use of the kit of claims 8-10. 

The analog produced by the method of any of claims 1-8 , said analog 

(i) comprising an amino acid sequence sharing greater than 60% 
identity with the C-terminal 96 amino acids of the sequence 
represented by Seq. ID No. 9 (OP-1, residues 335-431), and 

(ii) being substantially incapable of inducing an OP-1 mediated 
cellular response. 

The analog of claim 11 or 12 further having binding affinity for a 
Type II serine/threonine kinase cell surface receptor. 

The analog of claim 13 wherein said Type II receptor also has binding 
affinity for activin or BMP-4. 

An isolated ligand- receptor complex comprising two molecules 
interacting as specific binding partners, the first said molecule 
defining said ligand and comprising at least the C-terminal 96 amino 
acids of OP1 (residues 335-431 of Seq ID No. 9) or a receptor-binding 
analog thereof, and the second said molecule defining said receptor and 
being selected from the group consisting of: 

(i) a polypeptide chain comprising an amino acid sequence 

defined by residues 16-123 of Seq. ID No. 3 (ALK-2 ) , or an 
OPl-binding analog thereof; 

(ii) a polypeptide chain comprising an amino acid sequence 

defined by residues 24-152 of Seq. ID No. 5 (ALK-3) , , or an 
OPl-binding analog thereof; 

(iii) a polypeptide chain comprising an amino acid sequence 
defined by residues 23-122 of Seq. ID No. 7 (ALK-6) , , 

OP1 binding analog thereof; 
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(iv) a polypeptide chain having binding affinity for OP-1 and 
sharing at least 40% amino acid identity with residues 23- 
122 of Seg. ID No. 7 (ALK-6 ) , ; 

(v) a polypeptide chain having binding affinity for OP-1 and 
encoded by a nucleic acid obtainable by amplification with 
one or more primer sequences defined by Seq. ID Nos. 12-15; 
or 

(vi) a polypeptide chain having binding affinity for OP-1 and 
encoded by a nucleic acid that hybridizes under stringent 
conditions with a nucleic acid comprising the sequence 
defined by nucleotides 256-552 of Seq. ID No. 7 (ALK-6) , . 

16. The complex of claim 15 further comprising part or all of a Type II 
serine/threonine kinase receptor . 

17. The complex of claim 16 wherein said Type II receptor also has binding 
affinity for activin or BMP-4. 

18. The complex of any of claims 15-17 wherein said first molecule defining 
said ligand is an OP-1 receptor-binding analog comprises part or all 
of the proteins selected from the group consisting of 60A, BMP-5, BMP- 
6, Vgr-1, OP2, OP3 and receptor-binding amino acid sequence variants or 
xenogenic homologs thereof. 

19. An isolated binding partner having specific binding affinity for an 

epitope on a ligand-receptor complex, said complex being characterized 
as comprising an OP-1 protein or an analog thereof in specific binding 
interaction with the ligand binding domain of a cell surface receptor 
defined by Seq. ID No. 3 (ALK-2) , 5, or 7, or an OPl-binding analog 
thereof; said binding partner having subst anti ally no binding affinity 
for the uncomplexed form of said OP-1 protein or OP-1 protein analog. 
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The isolated binding partner of claim 19 wherein said binding partner 
is further characterized as having substantially no binding affinity 
for the uncomplexed form of said cell surface receptor protein or said 
analog thereof. 

The binding . partner of claim 19 wherein said binding partner is a 
monoclonal or polyclonal antibody. 

Use of the OP-1 receptor-binding analog of any claims 11-14 in a method 

(i) antagonizing OP-1 binding to a cell surface receptor; or 

(ii) antagonizing induction of an OP-1 mediated cellular 
response . 
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The use according to claim 22 wherein said OP-1 receptor-binding analog 
comprises an antibody having binding specificity for 

(i) the ligand binding domain of a cell surface receptor defined 
by Seq. ID Nos. 3, 5, or 7 or an OP-1 binding analog 
thereof; or 

(ii) the receptor binding domain of OP-1, represented by Seq. ID 
No. 9, or a receptor-binding analog thereof. 

Use of a protein selected from the group consisting of: 

(i) 3 polypeptide chain comprising an amino acid sequence 

defined by residues 16-123 of Seq. ID No. 3 (ALK-2), or an 
OP1 -binding analog thereof; 

(ii) a polypeptide chain comprising an amino acid sequence 
defined by residues 24-152 of Seq. ID No. 5 (ALK-3 ) , , or an 
OPl-binding analog thereof; 

(iii) a polypeptide chain comprising an amino acid sequence 
defined by residues 23-122 of Seq. ID No. 7 (ALK-6) , , or an 
OP1 binding analog thereof; 

(iv) a polypeptide chain having binding affinity for OP-1 and 
sharing at least 40% amino acid identity with residues 23- 
122 of Seq. ID No. 7 (ALK-6) , ; 
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(v) a polypeptide chain having binding affinity for OP-1 and 
encoded by a nucleic acid obtainable by amplification with 
one or more primer sequences defined by Seq. ID Nos. 12-15; 
or 

(vi) a polypeptide chain having binding affinity for OP-1 and 
encoded by a nucleic acid that hybridizes under stringent 
conditions with a nucleic acid comprising the sequence 
defined by nucleotides 256-552 of Seq. ID No. 7 (ALK-6),; 

in a method for antagonizing 

(i) OP-1 binding to a cell surface receptor; or 

(ii) induction of an OP-1 mediated cellular response. 

A method for antagonizing activin binding to a cell surface receptor, 
the method comprising the step of: 

providing a cell expressing a said receptor with a protein having 
binding specificity for the amino acid sequence defined by 
residues 16-123 of Seq ID No. 3 or an OP-1 binding sequence 
variant thereof, said protein sharing at least 60% amino acid 
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sequence identity with residue 335-431 of the sequence defined by 
Seq ID No. 9, 

such that said protein, when provided to said cell, is competent 
tb interact specifically with said receptor, thereby 
substantially inhibiting activin binding to said receptor. 

A method for antagonizing BMP-4 binding to a cell surface receptor, the 
method comprising the step of: 

providing a cell expressing a said receptor with a protein having 
binding specificity for the ligand binding domain defined by. 
residues 24-152. of Seq ID No. 5 (ALK-3) , or residues 23-122 of 
Seq ID No. 7 (ALK-6) , or an OP-1 binding sequence variant 
thereof, said protein sharing at least 60% amino acid sequence 
identity with residues 335-431 of the sequence defined by Seq ID 
No. 9, 

such that said protein, when provided to said cell, is competent 
to interact specifically with said receptor, thereby 
substantially inhibiting BMP-4 binding to said receptor. 

Use of the OP-1 receptor binding analog of claim 12-14 in the method of 
claim 25 or 26 . 
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